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Diffused Emitter and Base Silicon 
Transistors* 


By M. TANENBAUM and D. E. THOMAS 
(Manuscript received October 21, 1955) 


Silicon n-p-n transistors have been made in which the base and emitter 
regions were produced by diffusing impurities from the vapor phase. Tran- 
sistors with base layers 3.8 X 10°*-cm thick have been made. The diffusion 
techniques and the processes for making electrical contact to the structures 
are described. 

The electrical characteristics of a transistor with a maximum alpha of 
0.97 and an alpha-cutoff of 120 mc/sec are presented. The manner in which 
some of the electrical parameters are determined by the distribution of the 
doping impurities 2s discussed. Design data for the diffused emitter, dif- 
fused base structure ts calculated and compared with the measured char- 
acteristics. 


INTRODUCTION 

The necessity of thin base layers for high-frequency operation of tran- 
sistors has long been apparent. One of the most appealing techniques for 
controlling the distribution of impurities in a semiconductor is the dif- 
fusion of the impurity into the solid semiconductor. The diffusion co- 
efficients of Group III acceptors and Group V donors into germanium 
and silicon are sufficiently low at judiciously selected temperatures so 

* A portion of the material of this paper was presented at the Semiconductor 


Device Conference of the Institute of Radio Engineers, Philadelphia, Pa., June, 
1955. 
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that it is possible to envision transistors with base layer thicknesses of a 
micron and frequency response of several thousand megacycles per 
second. 

A major deterent to the application of diffusion to silicon transistor 
fabrication in the past was the drastic decrease in lifetime which generally 
occurs when silicon is heated to the high temperatures required for dif- 
fusion. There was also insufficient knowledge of the diffusion parameters 
to permit the preparation of structures with controlled layer thicknesses 
and desired dopings. Recently the investigations of C. S. Fuller and co- 
workers have produced detailed information concerning the diffusion of 
Group ITI and Group V elements in silicon. This information has made 
possible the controlled fabrication of transistors with base layers suffi- 
ciently thin that high alphas are obtained even though the lifetime has 
been reduced to a fraction of a microsecond. In a cooperative program | 
with Fuller, diffusion structures were produced which have permitted 
the fabrication of transistors whose electrical behavior closely approxi- 
mates the behavior anticipated from the design. This paper describes 
these techniques which have resulted in high alpha silicon transistors 
with alpha-cutoff of over 100 mc/sec. 


1.0 FABRICATION OF THE TRANSISTORS 


Fuller’s work’ has shown that in silicon the diffusion coefficient of a 
Group III acceptor is usually 10 to 100 times larger than that of the 
Group V donor in the same row in the periodic table at the same tem- 
peratures. These experiments were performed in evacuated silica tubes 
using the Group III and Group V elements as the source of diffusant. 
Under these conditions a particular steady state surface concentration 
of the diffusant is produced and the depth of diffusion is sensitive to 
this concentration as well as to the diffusion coefficient. The experiments 
show that the effective steady state surface concentration of the donor 
impurities produced under these conditions is ten to one hundred times 
greater than that of the acceptor impurities. Thus, by the simultaneous 
diffusion of selected donor and acceptor impurities into n-type silicon 
an n-p-n structure will result. The first n-layer forms because the surface 
concentration of the donor is greater than that of the acceptor. The 
p-layer is produced because the acceptor diffuses faster than the donor 
and gets ahead of it. The final n-region is simply the original background 
doping of the n-type silicon sample. It has been possible to produce n-p-n 
structures by the simultaneous diffusion of several combinations of 
donors and acceptors. Often, however, the diffusion coefficients and — 
surface concentrations of the donors and acceptors are such that opti- 


1C. 8. Fuller, private communication. 
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mum layer thicknesses (see Sections 3 and 4) are not produced by simul- 
taneous diffusion. In this case, one of the impurities is started ahead of 
the other in a prior diffusion, and then the other impurity is diffused 
in a second operation. 

With the proper choice of diffusion temperatures and times it has been 
possible to make n-p-n structures with base layer thicknesses of 2 X 10° 
em. The uniformity of the layers in a given specimen is better than ten 
per cent of the layer thickness. Fig. 1 illustrates the uniformity of the 
layers. This figure is an enlarged photograph of a view perpendicular 
to the surface of the specimen. A bevel which makes an angle of five 
degrees with the original surface has been polished on the specimen. This 
angle magnifies the layer thickness by 11.5. The layer is defined by an 
etchant which preferentially stains p-type silicon’ and the width of the 
layer is measured with a calibrated microscope. 

After diffusion the entire surface of the silicon wafer is covered with 
the diffused n- and p-type layers, see Fig. 2(a). Electrical contact must 
now be made to the three regions of the device. The base contact can 
be made by polishing a bevel on the specimen to expose and magnify 
the base layer and then alloying a lead to this region by the same tech- 





t———-14 EQUIVALENT TO 2X 10°4CM 
LAYER THICKNESS 


Fig. 1 — Angle section of a double diffused silicon wafer. The p-type center 
layer is approximately 2 X 10-4 em thick. 
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niques employed in the fabrication of grown junction transistors, Fig. 
2(b). However, a much simpler technique has been evolved. If the sur- 
face concentration of the donor diffusant is maintained below a certain 
critical value, it is possible to alloy an aluminum wire directly through 
the diffused n-type layer and thus make effective contact to the base 
layer, Fig. 2(c). Since the resistivity of the original silicon wafer is one 
to five ohm-cm, the aluminum will be rectifying to this region. It has 
been experimentally shown that if the surface concentration of the 
donor diffusant is less than the critical value mentioned above, the 
aluminum will also be rectifying to the diffused n-type region and the 
contact becomes merely an extension of the base layer. The n-layers 
produced by diffusing from elemental antimony are below the critical 
concentration and the direct aluminum alloying technique is feasible. 


Paes N+TYPE DIFFUSED LAYER 


N-TYPE (ORIGINAL WAFER) 





\ 
(a) \— p-TYPE DIFFUSED LAYER 






n+ =a £7 ALUMINUM WIRE 


P+ALUMINUM DOPED 
~ REGROWTH LAYER 


p (b) 
_-- ALUMINUM WIRE 


at P+ALUMINUM DOPED 
, REGROWTH LAYER 
xm 


pe (c) 


Fig. 2 — Schematic illustration of (a) double diffused n-p-n wafer, (b) angle 
ee method of making base contact, and (c) direct alloying method of making 
ase contact. 
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Fig. 3 — Mounted double diffused transistor. 


Contact to the emitter layer is achieved by alloying a film of gold 
containing a small amount of antimony. Since this alloy will produce 
an n-type regrowth layer, it is only necessary to insure that the gold- 
antimony film does not alloy through the p-type base layer, thus shorting 
the emitter to the collector. This is controlled by limiting the amount of 
gold-antimony alloy which is available by using a thin evaporated film 
or by electroplating a thin film of gold-antimony alloy on an inert metal 
point and alloying this structure to the emitter layer. 

Ohmic contact to the collector is produced by alloying the silicon 
wafer to an inert metal tab plated with a gold-antimony alloy. 
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The transistors whose characteristics are reported in this paper were 
prepared from 3 ohm-cm n-type silicon using antimony and aluminum 
as the diffusants. The base contact was produced by evaporating alumi- 
num through a mask so that a line approximately 0.005 x 0.015 cm in 
lateral dimensions and 100,000 A thick was formed on the surface. This 
aluminum ‘line was alloyed through the emitter layer in a subsequent 
operation. The wafer was then alloyed onto the plated kovar tab. A 
small area approximately 0.015 cm in diameter was masked around the 
line and the wafer was etched to remove the unwanted layers. The unit 
was then mounted in a header. Electrical contact to the collector was 
made by soldering to the kovar tab. Contact to the base was made with 
a tungsten point pressure contact to the alloyed aluminum. Contact 
to the emitter was made by bringing a gold-antimony plated tungsten 
point into pressure contact with the emitter layer. The gold-antimony 
plate was then alloyed by passing a controlled electrical pulse between 
the plated point and the transistor collector lead. Fig. 3 is a photograph 
of a mounted unit. 


2.0 ELECTRICAL CHARACTERISTICS 


The frequency cutoffs of experimental double diffused silicon tran- 
sistors fabricated as described above are an order of magnitude higher 
than the known cutoff frequencies of earlier silicon transistors. This is 
shown in Fig. 4 which gives the measured common base and common 
emitter current gains for one of these units as a function of frequency. 
The common base short-circuit current gain is seen to have a cutoff fre- 
quency of about 120 mc/sec.” The common emitter short-circuit current 
gain is shown on the same figure. The low-frequency current gain is 
better than thirty decibels and the cutoff frequency which is indicated 
by the frequency at which the gain is 3 db below its low-frequency 
value is 3 mc/sec. This is an exceptionally large common emitter band- 
width for a thirty db common emitter current gain and is of the same 
order of magnitude as that obtained with the highest frequency ger- 
manium transistors (e.q., p-n-i-p or tetrode) which had been made 
prior to the diffused base germanium transistor.° 


? The increase in common base current gain above unity (indicated by current 
gain in decibels being positive) in the vicinity of 50 mc/sec is caused by a reactance 
gain error in the common base measurement. This error is caused by a combination 
of the emitter to ground parasitic capacitance and the positive reactance com- 
ponent of the transistor input impedance resulting from phase shift in the alpha 
current gain. 

3C. A. Lee, A High-Frequency, Diffused Base Germanium Transistor, see 
page 23. 
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Fig. 4 — Short-circuit current gain of a double diffused silicon n-p-n transistor 
as a function of frequency in the common emitter and common base connections. 


Fig. 5 shows a high-frequency lumped constant equivalent circuit 
for the double diffused silicon transistor whose current gain cutoff char- 
acteristic is shown in Fig. 4. External parasitic capacitances have been 
omitted from the circuit. The configuration is the conventional one for 
junction transistors with two exceptions. A series resistance 7,’ has been 
added in the emitter circuit to account for contact resistance resulting 
from the fact that the present emitter point contacts are not perfectly 
ohmic. A second resistance 7,’ has been added in the collector circuit to 
account for the ohmic resistance of the n-type silicon between the col- 
lector terminal and the effective collector junction. This resistance exists 
in all junction transistors but in larger area low frequency junction 
transistors its effect on alpha-cutoff is sufficiently small so that it has 
been ignored in equivalent circuits of these devices. The collector RC 


Ce = 7uur 





Cc= O.S5244F re = 150 @ 


(~); ois 
a le 


1 


COMMON BASE CURRENT = 120MC 
GAIN CUT-OFF FREQUENCY 


Ig =3MA 
Ve = 10 VOLTS 


Fig. 5 — High-frequency lumped constant equivalent circuit for a double 
diffused silicon n-p-n transistor. 
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cutoff caused by the collector capacitance and the combined collector 
body resistance and base resistance is an order of magnitude higher 
than the measured alpha cutoff frequency and therefore is not too serious 
in impairing the very high-frequency performance of the transistor. 
This is due to the low capacitance of the collector junction which is 
seen to be approximately 0.5 mmf at 10 volts collector voltage. The 
base resistance of this transistor is less than 100 ohms which is quite low 
and compares very favorably with the best low frequency transistors 
reported previously. 

The low-frequency characteristics of the double diffused silicon tran- 
sistor are very similar to those of other junction transistors. This is il- 
lustrated in Fig. 6 where the static collector characteristics of one of 
these transistors are given. At zero emitter current the collector current 
is too small to be seen on the scale of this figure. The collector current 











CURRENT, I,, IN MILLIAMPERES 


Fig. 6 — Collector characteristics of a double diffused silicon n-p-n tran- 
sistor. 
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CURRENT, Ie, IN MILLIAMPERES 


Fig. 7 — Alpha as a function of emitter current and temperature for a double 
diffused silicon n-p-n transistor. 


under this condition does not truly saturate but collector junction re- 
sistance is very high. Collector junction resistances of 50 megohms at 
reverse biases of 50 volts are common. 

The continuous power dissipation permissible with these units is also 
shown in Fig. 6. The figure shows dissipation of 200 milliwatts and the 
units have been operated at 400 milliwatts without damage. As illus- 
trated in Fig. 3 no special provision has been made for power dissipation 
and it would appear from the performance obtained to date that powers 
of a few watts could be handled by these units with relatively minor 
provisions for heat dissipation. However, it can also be seen from Fig. 6 
that at low collector voltages alpha decreases rapidly as the emitter 
current is increased. The transistor is, therefore, non-linear in this 
range of emitter currents and collector voltages. In many applications, 
this non-linearity may limit the operating range of the device to values 
below those which would be permissible from the point of view of con- 
tinuous power dissipation. 

Fig. 7 gives the magnitude of alpha as a function of emitter current 
for a fixed collector voltage of 10 volts and a number of ambient tem- 
peratures. These curves are presented to illustrate the stability of the 
parameters of the double diffused silicon transistor at increased ambient 
temperatures. Over the range from 1 to 15 milliamperes emitter current 
and 25°C to 150°C ambient temperature, alpha is seen to change only 
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by approximately 2 per cent. This amounts to only 150 parts per million 
change in alpha per degree centigrade change in ambient temperature. 

The decrease in alpha at low emitter currents shown in Fig. 7 has been 
observed in every double diffused silicon transistor which has been made 
to date. Although this effect is not completely understood at present it 
could be caused by recombination centers in the base layer that can 
be saturated at high injection levels. Such saturation would result in an 
increase in effective lifetime and a corresponding increase in alpha. The 
large increase in alpha with temperature at low emitter currents is con- 
sistent with this proposal. It has also been observed that shining a strong 
light on the transistor will produce an appreciable increase in alpha at 
low emitter currents but has little effect at high emitter currents. A 
strong light would also be expected to saturate recombination centers 
which are active at low emitter currents and this behavior is also con- 
sistent with the above proposal. 


3.0 DISCUSSION OF THE TRANSISTOR STRUCTURE 


Although the low frequency electrical characteristics of the double 
diffused silicon transistor which are presented in Section 2 are quite 
similar to those usually obtained in junction transistors, the structure 
of the double diffused transistor is sufficiently different from that of the 
grown junction or alloy transistor that a discussion of some design 
principles is warranted. This section is devoted to a general discussion 
of the factors which determine the electrical characteristics of the tran- 
sistors. In Section 4 the general ideas of Section 3 are applied in a more 
specialized fashion to the double diffused structure and a detailed cal- 
culation of electrical parameters is presented. 

One essential difference between the double diffused transistor and 
grown junction or alloy transistors arises from the manner in which the 
impurities are distributed in the three active regions. In the ideal case 
of a double-doped grown junction transistor or an alloy transistor the 
concentration of impurities in a given region is essentially uniform and 
the transition from one conductivity type to another at the emitter and 
collector junctions is abrupt giving rise to step junctions. On the other 
hand in the double diffused structure the distribution of impurities is 
more closely described by the error function complement and the emitter 
and collector junctions are graded. These differences can have an appre- 
ciable influence on the electrical behavior of the transistors. 

Fig. 8(a) shows the probable distribution of donor impurities, Np , 
and acceptor impurities, V4, in a double diffused n-p-n. Fig. 8(b) is a 
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CONCENTRATION 
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Fig. 8 — Diagrammatic representation of (a) donor and acceptor distributions 
and (b) uncompensated impurity distribution in a double diffused n-p-n tran- 
sistor. 


plot of N» — N,4 which would result from the distribution in Fig. 8(a). 
Krémer* has shown that a nonuniform distribution of impurities in a 
semiconductor will produce electric fields which can influence the flow 
of electrons and holes. For example, in the base region the fields between 
the emitter junction, x, , and the minimum in the Np — Ny, curve, 7’, 
will retard the flow of electrons toward the collector while the fields 
between this minimum and the collector junction, x, , will accelerate the 
flow of electrons toward the collector. These base layer fields will affect 
the transit time of minority carriers across the base and thus contribute 


4H. Krémer, On Diffusion and Drift Transistor Theory I, II, III, Archiv. der 
Electr. Ubertragung, 8, pp. 223-228, pp. 363-369, pp. 499-504, 1954. 
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to the frequency response of the transistor. In addition the base re- 
sistance will be dependent on the distribution of both diffusants. These 
three factors are discussed in detail below. 

Moll and Ross’ have determined that the minority current, J,, , that 
will flow into the base region of a transistor if the base is doped in a non- 
uniform manner is given by 


2 
ni qDm eivikt 


ao [x@ dx 


(3.1) 


where n; is the carrier concentration in intrinsic material, q is the elec- 
tronic charge, V is the applied voltage, D,, is the diffusion coefficient of 
the minority carriers, and the integral represents the total number of 
uncompensated impurities in the base. The primary assumptions in this 
derivation are (1) planar junctions, (2) no recombination in the base 
region, and (8) a boundary condition at the collector junction that the 
minority carrier density at this point equals zero. It is also assumed that 
the minority carrier concentration in the base region just adjacent to the 
emitter junction is equal to the equilibrium minority carrier density at 
this point multiplied by the Boltzman factor exp (qV /kT). It is of special 
interest to note that J,, depends only on the total number of uncom- 
pensated impurities in the base and not on the manner in which they 
are distributed. 

In the double diffused transistor, it has been convenient from the 
point of ease of fabrication to make the emitter layer approximately the 
same thickness as the base layer. It has been observed that heating sili- 
con to high temperatures degrades the lifetime of n- and p-type silicon 
in a similar manner.” Both base and emitter layers have experienced the 
same heat treatment and to a first approximation it can be assumed that 
the lifetime in the two regions will be essentially the same. Thus as- 
sumptions (1) and (2) should also apply to current flow from base to 
emitter. If we assume that the surface recombination velocity at the 
free surface of the emitter is infinite, then this imposes a boundary 
condition at this side of the emitter which under conditions of forward 
bias on the emitter is equivalent to assumption (3). Thus an equation 
of the form of (3.1) should also give the minority current flow from base 
to emitter. Since the emitter efficiency, y, is given by 


5 J. L. Moll and I. M. Ross, The Dependence of Transistor Parameters on the 
Distribution of Base Layer Resistivity, Proc. I.R.E. in press. 
6G, Bemski, private communication, 
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In(emitter to base) 


a In(emitter to base) + Jn(base to emitter) 


proper substitution of (3.1) will give the emitter efficiency of the double 
diffused n-p-n transistor, 


1 
e [ Gra = Np) de 
pb 
i 
" { (Wo — Na) de 


Y= 
(3.2) 


In (3.2), D, is the diffusion coefficient of holes in the emitter, D, is the 
diffusion coefficient of electrons in the base and the. ratio of integrals is 
the ratio of total uncompensated doping in the base to that in the 
emitter. | 

A calculation of transit time is more difficult. Krémer* has studied 
the case of an aiding field which reduces transit time of minority carriers 
across the base region and thus increases frequency response. In the 
double diffused transistor the situation is more complex. Near the 
emitter side of the base region the field is retarding (Region R, see Fig. 8) 
and becomes aiding (Region A) only after the base region doping reaches 
a maximum. The case of retarding fields has been studied by Lee® and 
by Moll.’ At present, the case for a base region containing both types of 
fields has not been solved. However, at the present state of knowledge 
some speculations about transit time can be made. 

The two factors of primary importance are the magnitude of the 
built-in fields and the distance over which they extend. In the double 
diffused transistor, the widths of regions R and A are determined by the 
surface concentrations and diffusion coefficients of the diffusants. It 
can be shown by numerical computation’ that if region R constitutes no 
more than 30-40 per cent of the entire base layer width, then the overall 
effect of the built-in fields will be to aid the transport of minority car- - 
riers and to produce a reduction in transit time. In addition the absolute 
magnitude of region R is important. If the point x’ should occur within 
an effective Debye length from the emitter junction, 1.e., if x’ is located 
in the space charge region associated with the emitter junction, then the 
retarding fields can be neglected. 

The base resistance can also be calculated from surface concentrations 
and diffusion coefficients of the impurities. This is done by considering 
the base layer as a conducting sheet and determining the sheet con- 


7 J. L. Moll, private communication. 
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ductivity from the total number of uncompensated impurities per square 
centimeter of sheet and the appropriate mobility weighted to account 
for impurity scattering. 


4.0 CALCULATION OF DESIGN PARAMETERS 


To calculate the parameters which determine emitter efficiency, transit 
time, and base resistance it is assumed that the distribution of uncom- 
pensated impurities is given by 


N(x) = Myerfe = — Noerfe = of N3 (4.1) 


where N, and N» are the surface concentrations of the emitter and base 
impurity diffusants respectively, LZ: and Le are their respective diffusion 
lengths, and N3 is the original doping of the semiconductor into which 
the impurities are diffused. The impurity diffusion lengths are defined as 


In =2~VJSDit and Le = 2-~WDep (4.2) 


where the D’s are the respective diffusion coefficients and the ?’s are the 
diffusion times. 
Hquation (4.1) can be reduced ‘to 


r(é) = Tierfe & — Teerfe AE + 1 (4.3) 

where 
at Ni. _ Ne, _ & , _ th 
ES yg ae ra Seg eT 


For cases of interest here, ['() will be zero at two points, a and 8, 
and will have one minimum at &’. In the transistor structure the emitter 
junction occurs at € = « and the collector junction occurs at & = 8. 
Thus the base width is determined by 6 — a. The extent of aiding and 
retarding fields in the base is determined by ¢’. The integral of (4.3) 
from 0 toa, J; , and from a to 8, 2, are the integrals of interest in (3.2) 
and thus determine emitter efficiency. In addition J, is the integral from 
which base resistance can be calculated. 

The calculations which follow apply only for values of T,/T. and I, 
greater than ten. Some of the simplifying assumptions which are made 
will not apply at lower values of these parameters where the distribution 
of both diffusants as well as the background doping affect the structure 
in all three regions of the device. 
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4.1 Base Width a 


a 


From Fig. 8 and (4.3) it can be seen that for Tz = 10, a is essentially 
independent of I; and is primarily a function of T',/T2 and X. Fig. 9 is a 
plot of a versus Ti/T'2 with \ as the parameter. The particular plot is for 

= 10°. Although as stated a is essentially independent of I’: , at lower 
values of I's, a may not exist for the larger values of X, i.e., the p-layer 
does not form. 

In the same manner, it can be seen that 8 is essentially independent of 
I';/T2 and is a function only of Tl, and X. Fig. 10 is a plot of 6 versus I, 
with \ as a parameter. This plot is for T:/T, = 10 and at larger T/T, , 
6 may not exist at large X. 










an//au 
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Fig. 9 — Emitter layer thickness (in reduced units) as a function of the ratio 
of the surface concentrations of the diffusing impurities (T:/T'2) and the ratio of 
their diffusion lengths (A). 
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The base width n 
w=B-a 


can be obtained from Figs. 9 and 10. a, 8 and w can be converted to 
centimeters by multiplying by the appropriate value of Z,. 


4.2 Emitter Efficiency 


With the limits a and 8 determined above, the integrals J; and J: can 
be calculated. Examination of the integrals shows that J; is closely pro- 
portional to T,/T, and also to rz . On the other hand J2 is closely propor- 
._ tional to T, and essentially independent of T1/Ir',. Thus, the ratio of 
I,/I,; which determines y depends primarily on Ti/T.. Fig. 11 is a plot 
of the constant J./Z, contours in the T;/T, — X plane for [,/J in the 
range from —1.0 to —0.01. The graph is for T, = 10°. Since from (3.2) 


4 
oo ee (4.4) 
Dali 
for an n-p-n transistor, and assuming D,/D V4 for silicon, then 
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Fig. 10 — Collector junction depth (in reduced units) as a function of the sur- 
face concentration (in reduced units) of the diffusant which determines the con- 
ductivity type of the base layer (f'2) and the ratio of the diffusion lengths (A) of 
the two diffusing impurities. 


DIFFUSED EMITTER AND BASE SILICON TRANSISTORS 17 





Fig. 11 — Dependence of emitter efficiency upon diffusant surface concentra- 
tions and diffusion lengths. The lines of constant J:/J, are essentially lines of 
constant emitter efficiency. The ordinate is the ratio of surface concentrations of 
the two diffusants and the abscissa is the ratio of their diffusion lengths. 


I,/I, = —1.0 corresponds to a y of 0.75 and I2/I, = —0.01 corresponds 
to avy of 0.997. 


4.3 Base Resistance 


It was indicated above that J, depends principally on TY; and X. Fig. 12 
is a plot of the constant J, contours in the I, — d plane for J, in the range 
from —10° to —10. The graph is for T/T, = 10. The base layer sheet 
conductivity, g, , can be calculated from these data as 


go = —QpleliN; (4.5) 


where q, I, and N; are as defined above and ji is a mobility properly 
weighted to account for impurity scattering in the non-uniformly doped 
base region. The units of g» are mhos per square. 


18 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 





Fig. 12 — Dependence of base layer sheet conductivity on diffusant surface 
concentrations and diffusion lengths. The lines of constant J: are essentially lines 
of constant base sheet conductivity. The ordinate is the surface concentration 
(in reduced units) of the diffusant which determines the conductivity type of the 
base layer and the abscissa is the ratio of the diffusion lengths of the two Oditusing 
impurities. 


44 Transit Time 


With a knowledge of where the minimum value, £’, of (4.8) occurs, 
it is possible to calculate over what fraction of the base width the fields 
are retarding. The interesting quantity here is 





£ — 2a 
AR = Bee | 
¢’ is a function of T:/T'2 and \ and varies only very slowly with T/T. 
a is also a function of T,/T, and \ and varies only slowly with T1/T2. 
The most rapidly changing part of AR is 8 which depends primarily on 
T2 as noted above. Fig. 13 is a plot of the constant AR contours in the 
IT. — Xd plane for values of AR in the range 0.1 to 0.3. This graph is 
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for data with Ti1/T2 = 10. As T:/I’2 increases at constant I’, and A, AR 
decreases slightly. At [1/T'2 = 10°, the average change in AR is a decrease 
of about 25 per cent for constant Iz and \ when AR S 0.3. The error is 
larger for values of AR greater than 0.3, It was noted above that when 
AR becomes greater than 0.3, the retarding fields become dominant. 
Therefore, this region is of slight interest in the design of a high frequency 
transistor. 

4.5 A Sample Design 


By superimposing Figs. 11, 12 and 13 the ranges of I’. , T1/I'2 and d 
which are consistent with desired values of y, g, and AR can be deter- 
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Fig. 13 — Dependence of the built-in field distribution on concentrations and 
diffusion lengths. The lines of constant AR indicate the fraction of the base layer 
thickness over which built-in fields are retarding. The ordinate is the surface 
concentration (in reduced units) of the diffusant which determines the conductiv- 
ity type of the base layer and the abscissa is the ratio of the diffusion lengths of 
the two diffusing impurities. 
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mined by the area enclosed by the specified contour lines. It is also 
possible to compare the measured parameters of a specific device and 
observe how closely they agree with what is predicted from the estimated 
concentrations and diffusion coefficients. This is done below for the 
transistor described in Sections 1 and 2. 

The comparison is complicated by the fact that the exact values of the 
surface concentrations and diffusion coefficients are not known precisely 
enough at present to permit an accurate evaluation of the design theory. 
However, the following values of concentrations and diffusion coefficients 
are thought to be realistic for this transistor.’ 


Ni=5X10" Di=3 X10” ty 
Ne=4>X10" D.,=25%X10" ft 
Nz = 10° 

From these values it is seen that 

| T/T, = 12.5; T2=400; 2»=06 


From Fig. 9,a = 1.9 and from Fig. 10, 8 = 3.6 and therefore w = 1.7. 
Measurement of the emitter and base layer dimensions showed that these 
layers were approximately the same thickness which was 3.8 X 10° cm. 
Thus the measured ratio of emitter width to base width of unity is in 
good agreement with the value of 1.1 predicted from the assumed con- 
centrations and diffusion coefficients. 

From Fig. 11, I2/l, = —0.01. If this value is substituted into (4:4), 
y = 0.997. This compares with a measured maximum alpha of 0.972. 

From Fig. 12, 7; = —15. Assuming an average hole mobility of 350 
cm’/volt. sec. and evaluating ZL: from the measured emitter thickness | 
and the calculated a, (4.5) gives a value of g, = 1.7 X 10° mhos per 
square. The geometry of the emitter and base contacts as shown in Fig. 
3 makes it difficult to calculate the effective base resistance from the 
sheet conductivity even at very small emitter currents. In addition at 
the very high injection levels at which these transistors are operated the 
calculation of effective base resistance becomes very difficult. However, 
from the geometry it would be expected that the effective base re- 
sistance would be no greater than 0.1 of the sheet resistivity or 600 ohms. 
This is about seven times larger than the measured value of 80 ohms 
reported in Section 2. 

From Fig. 18, AR is approximately 0.20. Thus there should be an over- 
all aiding effect of the built-in fields. In addition the impurity gradient 
at the emitter junction is believed to be approximately 10”/cm‘ and the 


5.7 < 10° 
1.2 x 10° 
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space charge associated with this gradient will extend approximately 

2 X 10°° em into the base region. The base thickness over which re- 

tarding fields extend is AR times the base width or 7.6 X 10° cm. Thus 

the first quarter of region R will be space charge and can be neglected. 
The frequency cutoff from pure diffusion transit is given by 


_ 24D 
Ia W? 


where W is the measured base layer thickness. Assuming D = 25 em’/sec 
for electrons in the base region, f. = 67 mc/sec. Since the measured 
cutoff was 120 mc/sec, the predicted aiding effect of the built-in field 
is evidently present. 

These computations illustrate how the measured electrical parameters 
can be used to check the values of the surface concentrations and dif- 
fusion coefficients. Conversely knowledge of the concentrations and 
diffusion coefficients aid in the design of devices which will have pre- 
scribed electrical parameters. The agreement in the case of the transistor 
described above is not perfect and indicates errors in the proposed values 
of the concentrations and diffusion coefficients. However, it is sufficiently 
close to be encouraging and indicate the value of the calculations. 

The discussion of design has been limited to a very few of the important 
parameters. Junction capacitances, emitter and collector resistances are 
among the other important characteristics which have been omitted 
here. Presumably all of these quantities can be calculated if the detailed 
structure of the device is known and the structure should be susceptible 
to the type of analysis used above. Another fact, which has been ignored, 
is that these transistors were operated at high injection levels and a low 
level analysis of electrical parameters was used. All of these other factors 
must be considered for a detailed understanding of the device. The object 
of this last section has been to indicate one path which the more detailed 
analysis might take. 


Sa (4.6) 


5.0 CONCLUSIONS 


By means of multiple diffusion, it has been possible to produce silicon 
transistors with alpha-cutoff above 100 mc/sec. Refinements of the 
described techniques offer the possibility of even higher frequency per- 
formance. These transistors show the other advantages expected from 
silicon such as low saturation currents and satisfactory operation at 
high temperatures. 

The structure of the double diffused transistor 1s susceptible to design 
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analysis in a fashion similar to that which has been applied to other junc- 
tion transistors. The non-uniform distribution of impurities produces 
significant electrical effects which can be controlled to enhance appre- 
ciably the high-frequency behavior of the devices. 

The extreme control inherent in the use of diffusion to distribute im- 
purities in a semiconductor structure suggests that this technique will 
become one of the most valuable in the fabrication of semiconductor 
devices. 
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A High-Frequency Diffused Base 


Germanium Transistor 


By CHARLES A. LEE 
(Manuscript received November 15, 1955) 


Techniques of impurity diffusion and alloying have been developed which 
make possible the construction of p-n-p junction transistors utilizing a 
diffused surface layer as a base region. An important jeature is the high 
degree of dimensional control obtainable. Diffusion has the advantages of 
being able to produce uniform large area junctions which may be utilized in 
high power devices, and very thin surface layers which may be utilized in 
high-frequency devices. 

Transistors have been made in germanium which typically have alphas 
of 0.98 and alpha-cutoff frequencies of 500 mc/s. The fabrication, electrical 
characterization, and design considerations of these transistors are dis- 
cussed. 


INTRODUCTION 


Recent work *” concerning diffusion of impurities into germanium 
and silicon prompted the suggestion’ that the dimensional control in- 
herent in these processes be utilized to make high-frequency transistors. 

One of the critical dimensions of junction transistors, which in many 
cases seriously restricts their upper frequency limit of operation, is the 
thickness of the base region. A considerable advance in transistor proper- 
ties can be accomplished if it is possible to reduce this dimension one or 
two orders of magnitude. The diffusion constants of ordinary donors 
and acceptors in germanium are such that, withn realizable tempera- 
tures and times, the depth of diffused surface layers may be as small as 
10° em. Already in the present works layers slightly less than 1 micron 
(10-* em) thick have been made and utilized in transistors. Moreover, 
the times and temperatures required to produce 1 micron surface layers 
permit good control of the depth of penetration and the concentration 
of the diffusant in the surface layer with techniques described below. 

If one considers making a transistor whose base region consists of such 
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a diffused surface layer, several problems become immediately apparent: 

(1) Control of body resistivity and lifetime during the diffusion heat- 
ing cycle. 

(2) Control of the surface concentration of the diffusant. 

(3) Making an emitter on the surface of a thin diffused layer and 
controlling the depth of penetration. 

(4) Making an ohmic base contact to the diffused surface layer. 
One approach to the solution of these problems in germanium which has 
enabled us to make transistors with alpha-cutoff frequencies in excess 
of 500 me/sec is described in the main body of the paper. 

An important characteristic feature of the diffusion technique is that 
it produces an impurity gradient in the base region of the transistor. 
This impurity gradiant produces a “built-in” electric field in such a 
direction as to aid the transport of minority carriers from emitter to 
collector. Such a drift field may considerably enhance the frequency 
response of a transistor for given physical dimensions.’ 

The capabilities of these new techniques are only partially realized 
by their application to the making of high frequency transistors, and 
even in this field their potential has not been completely explored. For 
example, with these techniques applied to making a p-n-i-p structure 
the possibility of constructing transistor amplifiers with usable gain at 
frequencies in excess of 1,000 mc/sec now seems feasible. 


DESCRIPTION OF TRANSISTOR FABRICATION AND PHYSICAL CHARACTERIS- 
TICS 


As starting material for a p-n-p structure, p-type germanium of 0.8 
ohm-cm resistivity was used. From the single crystal ingot rectangular 
bars were cut and then lapped and polished to the approximate dimen- 
sions: 200 X 60 X 15 mils. After a slight etch, the bars were washed in 
deionized water and placed in a vacuum oven for the diffusion of an 
n-type impurity into the surface. The vacuum oven consisted of a small 
molybdenum capsule heated by radiation from a tungsten coil and sur- 
rounded by suitable radiation shields made also of molybdenum. The 
capsule could be baked out at about 1,900°C in order that impurities 
detrimental to the electrical characteristics of the germanium be evapo- 
rated to sufficiently low levels.” 

As a source of n-type impurity to be placed with the p-type bars in 
the molybdenum oven, arsenic doped germanium was used. The rela- 
tively high vapor pressure of the arsenic was reduced to a desirable range 
(about 10~* mm of Hg) by diluting it in germanium. The use of ger- 
manium eliminated any additional problems of contamination by the 
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dilutant, and provided a convenient means of determining the degree of 
dilution by a measurement of the conductivity. The arsenic concentra- 
tions used in the source crystal were typically of the order of 10°-10'°/ce. 
These concentrations were rather high compared to the concentrations 
desired in the diffused surface layers since compensation had to be made 
for losses of arsenic due to the imperfect fit of the cover on the capsule 
and due to some chemical reaction and adsorption which occurred on the 
internal surfaces of the capsule. 

The layers obtained after diffusion were then evaluated for sheet con- 
ductivity and thickness. To measure the sheet conductivity a four-point 
probe method’ was used. An island of the surface layer was formed by 
masking and etching to reveal the junction between the surface layer 
and the p-type body. The island was then biased in the reverse direction 
with respect to the body thus effectively isolating it electrically during 
the measurement of its sheet conductivity. The thickness of the surface 
layer was obtained by first lapping at a small angle to the original surface 
(16°-1°) and locating the junction on the beveled surface with a thermal 
probe; then multiplying the tangent of the angle between the two sur- 
faces by the distance from the edge of the bevel to the junction gives the 
desired thickness. Another particularly convenient method of measuring 
the thickness’ is to place a half silvered mirror parallel to the original sur- 
face and count fringes, of the sodium D-line for example, from the edge 
of the bevel to the junction. Typically the transistors described here 
were prepared from diffused layers with a sheet conductivity of about 
200 ohms/square, and a layer thickness of (1.5 + 0.3) X 10° cm. 

When the surface layer had been evaluated, the emitter and base con- 
tacts were made using techniques of vacuum evaporation and alloying. 
For the emitter, a film of aluminum approximately 1,000 A thick was 
evaporated onto the surface through a mask which defined an emitter 
area of 1 X 2 mils. The bar with the evaporated aluminum was then 
placed on a strip heater in a hydrogen atmosphere and momentarily 
brought up to a temperature sufficient to alloy the aluminum. The 
emitter having been thus formed, the bar was again placed in the masking 
jig and a film of gold-antimony alloy from 3,000 to 4,000 A thick was 
evaporated onto the surface. This film was identical in area to the 
emitter, and was placed parallel to and 0.5 to 1 mil away from the 
emitter. The bar was again placed on the heater strip and heated to the 
gold-germanium eutectic temperature, thus forming the ohmic base 
contact. The masking jig was constructed to permit the simultaneous 
evaporation of eight pairs of contacts on each bar. Thus, using a 3-mil 
diamond saw, a bar could be cut into eight units. 
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Each unit, with an alloyed emitter and base contact, was then soldered 
to a platinum tab with indium, a sufficient quantity of indium being 
used to alloy through the n-type surface layer on the back of the unit. 
One of the last steps was to mask the emitter and base contacts with a 
6- to 8-mil diameter dot of wax and form a small area collector junction 
by etching the unit attached to the platinum tab, in CP4. After washing 
in solvents to remove the wax, the unit was mounted in a header designed 
to allow electrolytically pointed wire contacts to be made to the base and 
emitter areas of the transistor. These spring contacts were made of 1-mil 
phosphor bronze wire. 


ELECTRICAL CHARACTERIZATION 


Of the parameters that characterize the performance of a transistor, 
one of the most important is the short circuit current gain (alpha) ver- 
sus frequency. The measured variation of a and a/({1 — a) (short-circuit 
current gain in the grounded emitter circuit) as a function of frequency 
for a typical unit is shown in Fig. 1. For comparison the same parameters 
for an exceptionally good unit are shown in Fig. 2. 

In order that the alpha-cutoff frequency be a measure of the transit 
time of minority carriers through the active regions of the transistor, any 
resistance-capacity cutoffs, of the emitter and collector circuits, must lie 
considerably higher than the measured f,. In the emitter circuit, an 
external contact resistance to the aluminum emitter of the order of 10 
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Fig. 1 — The grounded emitter and grounded base response versus frequency 
for a typical unit. 
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Fig. 2— The grounded emitter and grounded base response versus frequency 
for an exceptionally good unit. 


to 20 ohms and a junction transition capacity of 1 uyufd were measured. 
The displacement current which flows through this transition capacity 
reduces the emitter efficiency and must be kept small relative to the 
injected hole current. With 1 milliampere of current flowing through the 
emitter junction, and consequently an emitter resistance of 26 ohms, 
the emitter cutoff for this transistor was above 6,000 mc/sec. One can 
now see that the emitter area must be small and the current density 
high to attain a high emitter cutoff frequency. The fact that a low base 
resistance requires a high level of doping in the base region, and thus a 
high emitter transition capacity, restricts one to small areas and high 
current densities. 

In the collector circuit capacities of 0.5 to 0.8 wufd at a collector volt- 
age of —10 volts were measured. There was a spreading resistance in the 
collector body of about 100 ohms which was the result of the small 
emitter area. The base resistance was approximately 100 ohms. If the 
phase shift and attenuation due to the transport of minority carriers 
through the base region were small at the collector cutoff frequency, the 
effective base resistance would be decreased by the factor (1 — a). The 
collector cutoff frequency is then given by 


1 
fo = 2rC Rk, 


where C, = collector transition capacity 


and f, = collector body spreading resistance. 
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However, in the transistors described here the base region produces the 
major contribution to the observed alpha-cutoff frequency and it is more 
appropriate to use the expression 


1 


Je = OnC(rs + Re) 


where 7, = base resistance. This cutoff frequency could be raised by in- 
creasing the collector voltage, but the allowable power dissipation in the 
mounting determines an upper limit for this voltage. It should b noted 
that an increase in the doping of the collector material would raise the 
cutoff since the spreading resistance is inversely proportional to N, , 
while the junction capacity for constant collector voltage is only pro- 
portional to N,"”. 

The low-frequency alpha of the transistor ranged from 0.95 to 0.99 
with some exceptional units as high as 0.998. The factors to be con- 
sidered here are the emitter efficiency y and the transport factor ~. 
The transport factor is dependent upon the lifetime in the base region, 
the recombination velocity at the surface immediately surrounding the 
emitter, and the geometry. The geometrical factor of the ratio of the 
emitter dimensions to the base layer thickness is >10, indicating that 
solutions for a planar geometry may be assumed.’ If a lifetime in the base 
region of 1 microsecond and a surface recombination velocity of 2,000 
cm/sec is assumed a perturbation calculation’ gives 


B = 0.995 


The high value of 6 obtained with what is estimated to be a low base 
region lifetime and a high surface recombination velocity indicates that 
the observed low frequency alpha is most probably limited by the 
emitter injection efficiency. As for the emitter injection efficiency, within 
the accuracy to which the impurity concentrations in the emitter re- 
growth layer and the base region are known, together with the thick- 
nesses of these two regions, the calculated efficiency is consistent with 
the experimentally observed values. 


CONSIDERATIONS OF TRANSIT TIME 


An examination of what agreement exists between the alpha-cutoff 
frequency and the physical measurements of the base region involves 
the mechanism of transport of minority carriers through the active 
regions of the transistor. The ‘‘active regions” include the space charge 
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region of the collector junction. The transit time through this region” 
is no longer a negligible factor. A short calculation will show that with 
—10 volts on the collector junction, the space charger layer is about 
4 <X 10 em thick and that the frequency cutoff associated with trans- 
port through this region is approximately 3,000 mc/sec. 

The remaining problem is the transport of minority carriers through 
the base region. Depending upon the boundary conditions existing at the 
surface of the germanium during the diffusion process, considerable 
gradients of the impurity density in the surface layer are possible. How- 
ever, the problem of what boundary conditions existed during the diffu- 
sion process employed in the fabrication of these transistors will not be 
discussed here because of the many uncertainties involved. Some quali- 
tative idea is necessary though of how electric fields arising from impurity 
gradients may affect the frequency behavior of a transistor in the limit 
of low injection. 

If one assumes a constant electric field as would result from an ex- 
ponential impurity gradient in the base region of a transistor, then the 
continuity equation may be solved for the distribution of minority 
carriers.’ From the hole distribution one can obtain an expression for 
the transport factor 8 and it has the form 


Z 


8 = e* —____________.. 
n sinh Z + Z cosh Z 
where 
> ONG EE? 


N. = donor density in base region at emitter junction 


N,. = donor density in base region at collector junction 


E = electric field strength 
D, = diffusion constant for holes 


w = width of the base layer 


| 
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A plot of this function for various values of 7 is shown in Fig. 3. For n = 0, 
the above expression reduces to the well known case of a uniformly doped 
base region. The important feature to be noted in Fig. 3 is that relatively 
small gradients of the impurity distribution in the base layer can produce 
a considerable enhancement of the frequency response. 

It is instructive to calculate what the alpha-cutoff frequency would be 
for a base region with a uniform distribution of impurity. The effective 
thickness of the base layer may be estimated by decreasing the measured 
thickness of the surface layer by the penetration of the space charge 
region of the collector and the depth of the alloyed emitter structure. 
Using a value for the diffusion constant of holes in the base region appro- 
priate to a donor density of about 10”/ce, 


300 me/s S fa S 800 mc/s 


This result implies that the frequency enhancement due to ‘‘built-in” 
fields is at most a factor of two. In addition it was observed that the 
alpha-cutoff frequency was a function of the emitter current as shown 
in Fig. 4. This variation indicates that at least intermediate injection 
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Fig. 3 — The variation of | 8 | versus frequency for various values of a uniform 
drift field i in the base region. 
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Fig. 4 — The variation of the alpha-cutoff frequency as a function of emitter 
current. 


levels exist in the range of emitter current shown in Fig. 4. The conclu- 
sion to be drawn then is that electric fields produced by impurity 
gradients in the base region are not the dominant factor in the transport 
of minority carriers in these transistors. 

The emitter current for a low level of injection could not be deter- 
mined by measuring f, versus J, because the high input impedance at 
very low levels was shorted by the input capacity of the header and 
socket. Thus at very small emitter currents the measured cutoff fre- 
quency was due to an emitter cutoff and was roughly proportional to 
the emitter current. At J, = 1 ma this effect is small, but here at least 
intermediate levels of injection already exist. 

A further attempt to measure the effect of any “built-in” fields by 
turning the transistor around and measuring the inverse alpha proved 
fruitless for two reasons. The unfavorable geometrical factor of a large 
collector area an a small emitter area as well as a poor injection effi- 
ciency gave an alpha of only 


a= 0.1 


Secondly, the injection efficiency turns out in this case to be proportional 
tow” giving a cutoff frequency of less than 1 mc/sec. The square-root 
dependence of the injection efficiency on frequency may be readily seen. 
The electron current injected into the collector body may be expressed as 


ee lr 


J e= qD,N | Le 


where q = electronic charge 
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D, = diffusion constant of electrons 


kT 
v, = voltage across collector junction 
n- = density of electrons on the p-type side of the collector junction 
T. = lifetime of electrons in collector body 


L, = diffusion length of electrons in the collector body 
Since the inverse cutoff frequency is well below that associated with the 


base region, we may regard the injected hole current as independent of 
the frequency in this region. The injection efficiency is low so that 


y= iB K1 
Thus at a frequency where 
Te > 1 
then 
yo agaie 


An interesting feature of these transistors was the very high current 
densities at which the emitter could be operated without appreciable loss 
of injection efficiency. Fig. 5 shows the transmission of a 50 millimicro- 
second pulse up to currents of 18 milliamperes which corresponds to a 
current density of 1800 amperes/cm*. The injection efficiency should 
remain high as long as the electron density at the emitter edge of the 
base region remains small compared to the acceptor density in the 
emitter regrowth layer. When high injection levels are reached the in- 
jected hole density at the emitter greatly exceeds the donor density in the 
base region. In order to preserve charge neutrality then 


pen 
where p = hole density 
n = electron density 


As the injected hole density is raised still further the electron density 
will eventually become comparable to the acceptor density in the 
emitter regrowth layer. The density of acceptors in the emitter regrowth 
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Fig. 5 — Transmission of a 50 millimicrosecond pulse at emitter currents up 
to 18 ma by a typical unit. (Courtesy of F. K. Bowers). 
region is of the order of 
Na & 10°°/ce 
and this is to be compared with injected hole density at the base region 


side of the emitter junction. The relation between the injected hole 
density and the current density may be approximated by* 


Jp = 2qD pp 
W 


where p, = hole density at emitter side of base region 
w = width of base region 


A short calculation indicates that the emitter efficiency should remain 
high at a current density of an order of magnitude higher than 1,800 
amp/cm’. The measurements were not carried to higher current densities 
because the voltage drop across the spreading resistance in the collector 
was producing saturation of the collector junction. 


CONCLUSIONS 


Impurity diffusion is an extremely powerful tool for the fabrication 
of high frequency transistors. Moreover, of the 50-odd transistors which 
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were made in the laboratory, the characteristics were remarkably uni- 
form considering the variations usually encountered at such a stage of 
development. It appears that diffusion process is sufficiently controllable 
that the thickness of the base region can be reduced to half that of the 
units described here. Therefore, with no change in the other design 
parameters, outside of perhaps a different mounting, units with a 1000 
me/s cutoff frequency should be possible. 
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ROOM DOB OO 


Waveguide Investigations with 
Millimicrosecond Pulses 


By A. C. BECK 
(Manuscript received October 11, 1955) 


Pulse techniques have been used for many waveguide testing purposes. 
The importance of increased resolution by means of short pulses has led to 
the development of equipment to generate, receive and display pulses about 
5 or 6 millimicroseconds long. The equipment is briefly described and its 
resolution and measuring range are discussed. Dominant mode waveguide 
and antenna tests are described and illustrated. Applications to multimode 
waveguides are then considered. Mode separation, delay distortion and its 
equalization, and mode conversion are discussed, and examples are given. 
The resolution obtained with this equipment provides information that is 
difficult to get by any other means, and its use has proved to be very helpful 
in waveguide investigations. 
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1. INTRODUCTION 


Pulse testing techniques have been employed to advantage in wave- 
guide investigations in numerous ways. The importance of better resolu- 
tion through the use of short pulses has always been apparent and, from 
the first, equipment was employed which used as short a pulse as pos- 
sible. Radar-type apparatus using magnetrons and a pulse width of 
about one-tenth microsecond has seen considerable use in waveguide 
research, and many of the results have been published.! 2 


30 
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To improve the resolution, work was initiated some time ago by S. E. 
Miller to obtain measuring equipment which would operate with much 
shorter pulses. As a result, pulses about 5 or 6 millimicroseconds long 
became available at a frequency of 9,000 mc. In a pulse of this length 
there are less than 100 cycles of radio frequency energy, and the signal 
occupies less than ten feet of path length in the transmission medium. 
The RF bandwidth required is about 500 me. In order to obtain such 
bandwidths, traveling wave tubes were developed by J. R. Pierce and 
members of the Electronics Research Department of the Laboratories. 
The completed amplifiers were designed by W. W. Mumford. N. J. 
Pierce, R. W. Dawson and J. W. Bell assisted in the design and construc- 
tion phases, and G. D. Mandeville has been closely associated in all of 
this work. 


2. PULSE GENERATION 


These millimicrosecond pulses have been produced by two different 
types of generators. In the first equipment, a regenerative pulse gener- 
ator of the type suggested by C. C. Cutler of the Laboratories was used.’ 
This was a very useful device, although somewhat complicated and hard 
to keep in adjustment. A brief description of it will permit comparisons 
with a simpler generator which was developed a little later. 

A block diagram of the regenerative pulse generator is shown in Fig. 1. 
The fundamental part of the system is the feedback loop drawn with 
heavy lines in the lower central part of the figure. This includes a travel- 
ing wave amplifier, a waveguide delay line about sixty feet long, a crystal 
expander, a band-pass filter, and an attenuator. This combination forms 
an oscillator which produces very short pulses of microwave energy. 
Between pulses, the expander makes the feedback loop loss too high for 
oscillation. Each time the pulse circulates around the loop it tends to 
shorten, due to the greater amplification of its narrower upper part 
caused by the expander action, until it uses the entire available band- 
width. A 500-me gaussian band-pass filter is used in the feedback loop 
of this generator to determine the final bandwidth. An automatic gain 
control operates with the expander to limit the pulse amplitude, thus 
preventing amplifier compression from reducing the available expansion. 

To get enough separation between outgoing pulses for reflected pulse 
measurements with waveguides, the repetition rate would need to be 
too low for a practical delay line length in the loop. ‘Therefore a 12.8-me 
fundamental rate was chosen, and a gated traveling wave tube amplifier 
was used to reduce it to a 100-ke rate at the output. This amplifier is 
kept in a cutoff condition for 127 pulses, and then a gate pulse restores 
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it to the normal amplifying condition for fifty millimicroseconds, during 
which time the 128th pulse is passed on to the output of the generator 
as shown on Fig. 1. 

The synchronizing system is also shown on Fig. 1. A 100-ke quartz 
crystal controlled oscillator with three cathode follower outputs is the 
basis of the system. One output goes through a seven stage multiplier 
to get a 12.8-mc signal, which is used to control a pulser for synchroniz- 
ing the circulating loop. Another output controls the gate pulser for the 
output traveling wave amplifier. Accurate timing of the gate pulse is 
obtained by adding the 12.8-me pulses through a buffer amplifier to the 
gate pulser. The third output synchronizes the indicator oscilloscope 
sweep to give a steady pattern on the screen. 

Although this equipment was fairly satisfactory and served for many 
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Fic. 1 — Block diagram of the regenerative pulse generator. 
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testing purposes, it was rather complex and there were some problems 
in its construction and use. It was difficult to obtain suitable microwave 
crystals to match the waveguide at low levels in the expander. This 
would make it even more difficult to build this type of pulse generator 
for higher frequency ranges. Stability also proved to be a problem. The 
frequency multiplier had to be very well constructed to avoid phase 
shift due to drifting. The gate pulser also required care in design and 
construction in order to get a stable and flat output pulse. It was some- 
what troublesome to keep the gain adjusted for proper operation, and 
the gate pulse time adjustment required some attention. The pulse 
frequency could not be changed. For these reasons, and in order to get 
a smaller, lighter and less complicated pulse generator, work was carried 
out to produce pulses of about the same length by a simpler method. 

If the gated output amplifier of Fig. 1 were to have a CW instead of a 
pulsed input, a pulse of microwave energy would nevertheless appear at 
the output because of the presence of the gating pulse. This gating pulse 
is applied to the beam forming electrode of the tube to obtain the gating 
action. If the beam forming electrode could be pulsed from cutoff to its 
normal operating potential for a very short time, very short pulses of 
output energy could be obtained from a continuous input signal. How- 
ever, it is difficult to obtain millimicrosecond video gating pulses of suf- 
ficient amplitude for this purpose at a 100-ke repetition rate. 

A traveling-wave tube amplifies normally only. when the helix is 
within a small voltage range around its rated de operating value. For 
voltages either above or below this range, the tube is cut off. When the 
helix voltage is raised through this range into the cutoff region beyond 
it, and then brought back again, two pulses are obtained, one during a 
small part of the rise time and the other during a small part of the return 
time. If the rise and fall times are steep, very short pulses can be 
obtained. Fig. 2 shows the pulse envelopes photographed from the 
indicator scope screen when this is done. For the top trace, the helix was 
biased 300 volts negatively from its normal operating potential, then 
pulsed to its correct operating range for about 80 millimicroseconds, 
during which time normal amplification of the CW input signal was ob- 
tained. The effect of further increasing the helix video pulse amplitude 
in the positive direction is shown by the succeeding lower traces. The 
envelope dips in the middle, then two separated pulses remain — one 
during a part of the rise time and one during a part of the fall time of 
helix voltage. The pulses shown on the bottom trace have shortened to 
about six millimicroseconds in length. The helix pulse had a positive 
-amplitude of about 500 volts for this trace. 
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Since only one of these pulses can be used to get the desired repetition 
rate, it is necessary to eliminate the other pulse. This is done in a simi- 
lar manner to that used for gating out the undesired pulses in the re- 
generative pulse generator. However, it is not necessary to use another 
amplifier, as was required there, since the same tube can be used for 
this purpose, as well as for producing the microwave pulses. Its beam 
forming electrode is biased negatively about 250 volts with respect to 
the cathode, and then is pulsed to the normal operating potential for 
about 50 millimicroseconds during the time of the first short pulse ob- 
tained by gating the helix. Thus, the beam forming electrode potential 
has been returned to the cutoff value during the second helix pulse, 
which is therefore eliminated. 

A block diagram of the resulting double-gated pulse generator is 
shown in Fig. 3. Comparison with Fig. 1 shows that it is simpler 
than the regenerative pulse generator, and it has also proved more 
satisfactory in operation. It can be used at any frequency where a sig- 
nal source and a traveling-wave amplifier are available, and the pulse 





Fig. 2 — Envelopes of microwave pulses at the output of a traveling wave am- 
plifier with continuous wave input and helix gating. The gating voltage is higher 
for the lower traces. 
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frequency can be set anywhere within the bandwidth of the traveling- 
wave amplifier by tuning the klystron oscillator. 

The pulse center frequency is shifted from that of the klystron os- 
cillator frequency by this helix gating process. An over-simplified but 
helpful explanation of this effect can be obtained by considering that 
the microwave signal voltage on the helix causes a bunching of the elec- 
tron stream. This bunching has the same periodicity as the microwave 
signal voltage when the dc helix potential is held constant. However, 
since the helix voltage is continuously increased in the positive direction 
during the time of the first pulse, the average velocity of the last bunches 
of electrons becomes higher than that of the earlier bunches in the pulse, 
because the later electrons come along at the time of higher positive 
helix voltage. This tends to shorten the total length of the series of 
bunches, resulting in a shorter wavelength at the output end of the 
helix and therefore a higher output microwave frequency. On the second 
pulse, obtained when the helix voltage returns toward zero, the process 
is reversed, the bunching is stretched out, and the frequency is de- 
creased. This second pulse is, however, gated out in this arrangement 
by the beam-forming electrode pulsing voltage. The result for this 
particular tube and pulse length is an effective output frequency ap- 
proximately 150 mc higher than the oscillator frequency, but this figure 
is not constant over the range of pulse frequencies available within the 
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Fig. 3 — Block diagram of the double-gated traveling wave tube millimicro- 
second pulse generator. 
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3. RECEIVER AND INDICATOR 


The receiving equipment is shown in Fig. 4. It uses two traveling- 
wave amplifiers in cascade. A wide band detector and a video amplifier 
then follow, and the signal envelope is displayed by connecting it to 
the vertical deflecting plates of a 5 XP type oscilloscope tube. The 
video amplifier now consists of two Hewlett Packard wide band dis- 
tributed amplifiers, having a baseband width of about 175 mc. The 
second one of these has been modified to give a higher output voltage. 
The sweep circuits for this oscilloscope have been built especially for 
this use, and produce a sweep speed in the order of 6 feet per micro- 
second. An intensity pulser is used to eliminate the return trace. These 
parts of the system are controlled by a synchronizing output from the 
pulse generator 100-ke oscillator. A precision phase shifter is used at 
the receiver for the same purpose that a range unit is employed in radar 
systems. This has a dial, calibrated in millimicroseconds, which moves 
the position of a pulse appearing on the scope and makes accurate 
measurement of pulse delay time possible. 

Fig. 4 also shows the appearance of the pulses obtained with this 
equipment. The pulse on the left-hand side of this trace came from the 
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Fig. 4 — Block diagram of millimicrosecond pulse receiver and indicator. The 
indicator trace photograph shows pulses from each type of generator. 
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newer double-gated pulse generator, while the pulse on the right was 
produced by the regenerative pulse generator. It can be seen that they 
appear to have about the same pulse width and shape. This is partly 
due to the fact that the video amplifier bandwidth is not quite adequate 
to show the actual shape, since in both cases the pulses are slightly 
shorter than can be correctly reproduced through this amplifier. The 
ripples on the base line following the pulses are also due to the video 
amplifier characteristics when used with such short pulses. 


4, RESOLUTION AND MEASURING RANGE 


Fig. 5 shows a piece of equipment which was placed between the pulse 
generator and the receiver to show the resolution which can be obtained. 
This waveguide hybrid junction has its branch marked 1 connected to 
the pulse generator and branch 3 connected to the receiver. If the two 
side branches marked 2 and 4 were terminated, substantially no energy 
would be transmitted from the pulser straight through to the receiver. 
However, a short circuit placed on either side branch will send energy 
through the system to the receiver. Two short circuits were so placed 
that the one on branch 4 was 4 feet farther away from the hybrid junc- 
tion than the one on branch 2. The pulse appearing first is produced by 
a signal traveling from the pulse generator to the short circuit on branch 
2 and then through to the receiver, as shown by the path drawn with 
short dashes. A second pulse is produced by the signal which travels 
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Fig. 5 — Waveguide hybrid circuit used to demonstrate resolution of milli- 
microsecond pulses, Trace photographs of pulses from each type of generator are 
shown, 
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TEG IN 3” DIAM COPPER GUIDE (150 FT LONG) 


Fig. 6 — Waveguide arrangement and oscilloscope trace photos showing pres- 
ence and location of defective joint. The dominant mode (TI);) was used with its 
polarization changed 90 degrees for the two trace photos. 


from the pulse generator through branch 4 to the short circuit and then 
to the receiver as shown by the long dashed line. This pulse has traveled 
8 feet farther in the waveguide than the first pulse. This would be equiva- 
lent to seeing separate radar echoes from two targets about 4 feet apart. 
Resolution tests made in this way with the pulses from the regenerative 
pulse generator, and from the double-gated pulse generator, are shown 
on Fig. 5. With our video amplifier and viewing equipment, there is 
no appreciable difference in the resolution obtained using either type 
of pulse generator. 

The measuring range is determined by the power output of the gated 
amplifier at saturation and by the noise figure of the first tube in the 
receiver. In this equipment the saturation level is about 1 watt, and the 
noise figure of the first receiver tube is rather poor. As a result, received 
pulses about 70 db below the outgoing pulse can be observed, which is 
enough range for many measurement purposes. 


5. DOMINANT MODE WAVEGUIDE TESTS 


Fig. 6 shows the use of this equipment to test 3” round waveguides 
such as those installed between radio repeater equipment and an an- 
tenna. This particular 150-foot line had very good soldered joints and was 
thought to be electrically very smooth. The signal is sent in through a 
transducer to produce the dominant TE; mode. The receiver is con- 
nected through a directional coupler on the sending end to look for any 
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¥ig. 7 — Defective joint caused by imperfect soldering which gave the reflec- 
tion shown on Fig. 6. 


reflections from imperfections in the line. The overloaded signal at the 
left of the oscilloscope trace is produced by leakage directly through 
the directional coupler. The overloaded signal on the other end of this 
trace is produced by the reflection from the short circuit piston at the 
far end of the waveguide. The signal between these two, which is about 
45 db down from the input signal, is produced by an imperfect joint 
in the waveguide. The signal polarization was oriented so that a maxi- 
mum reflection was obtained in the case of the lower trace. In the 
other trace, the polarization was changed by 90°. It is seen that this 
particular joint produces a stronger reflection for one polarization than 
for the other. By use of the precision phase shifter in the receiver the 
exact location of this defect was found and the particular joint that was 
at fault was sawed out. Fig. 7 shows this joint after the pipe had been 
cut in half through the middle. The guide is quite smooth on the inside 
in spite of the discoloration of some solder that is shown here, but on 
the left-hand side of the illustration the open crack is seen where the 
solder did not run in properly. This causes the reflected pulse that shows 
on the trace. The fact that this crack is less than a semi-circumference 
in length causes the echo to be stronger for one polarization than for the 
other. 
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Fig. 8 shows the same test for a 3’’ diameter aluminum waveguide 
250 feet long. This line was mounted horizontally in the test building 
with compression couplings used at the joints. The line expanded on 
warm days but the friction of the mounting supports was so great that 
it pulled open at some of the joints when the temperature returned to 
normal. These open joints produced reflected pulses from 40 to 50 db 
down, which are shown here. They come at intervals equal to the length 
of one section of pipe, about 12 feet. Some of these show polarization 
effects where the crack was more open on one side than on the other, 
but others are almost independent of polarization. These two photo- 
graphs of the trace were taken with the polarization changed 90°. 

Fig. 9 shows the same test for a 3” diameter galvanized iron wave- 
guide. This line had shown fairly high loss using CW for measure- 
ments. The existence of a great many echoes from random distances 
indicates a rough interior finish in the waveguide. Fig. 10 shows the 
kind of inperfections in the zine coating used for galvanizing which 
caused these reflections. 


6. TESTING ANTENNA INSTALLATIONS 


The use of this equipment in testing waveguide and antenna installa- 
tions for microwave radio repeater systems is shown in Fig. 11. This 
particular work was done in cooperation with A. B. Crawford’s antenna 
research group at Holmdel, who designed the antenna system. A direc- 
tional coupler was used to observe energy reflections from the system ° 
under test. In this installation a 3’’ diameter round guide carrying the 
TEx mode was used to feed the antenna. Two different waveguide 
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Fig. 8 — Reflections from several defective joints in a dominant (TE,,) mode 
waveguide. The two trace photos are for polarizations differing by 90 degrees. 
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Fic, 9 — Multiple reflections from a dominant (TIE,,) mode waveguide with a 


rough inside surface. The two trace photos are for polarizations differing by 90 
degrees. 


joints are shown here. In addition, a study was being made of the re- 
turn loss of the transition piece at the throat of the antenna which 
connected the 3” waveguide to the square section of the horn. The 
waveguide sections are about 10 feet long. The overloaded pulse at the 
left on the traces is the leakage through the directional coupler. The 





Fig. 10 — Rough inside surface of a galvanized iron waveguide which produced 
the reflections shown on Fig. 9. 
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other echoes are associated with the parts of the system from which 
they came by the dashed lines and arrows on the figure. A clamped 
joint in the line gave the reflection shown next following the initial 
overloaded pulse. A well made threaded coupling in which the ends of 
the pipe butted squarely is seen to have a very much lower reflection, 
scarcely observable on this trace. Since there is always reflection from 
the mouth and upper reflector parts of this kind of antenna, it is not 
possible to measure a throat transition piece alone by conventional CW 
methods, as the total reflected power from the system is measured. 
Here, use of the resolution of this short pulse equipment completely 
separated the reflection of the transition piece from all other reflections 
and made a measurement of its performance possible. In this particular 
case, the reflection from the transition is more than 50 db down from 
the incident signal which represents very good design. As can be seen, 
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Fig. 11 — Waveguide and antenna arrangement with trace photos showing re- 
flections from joints, transition section, and cover. 
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the reflection from the parabolic reflector and mouth is also quite low, 
and this characterizes a good antenna installation. 

The extra reflected pulse on the right of the lower trace on Fig. 11 
appeared when a fiberglas weatherproof cover was installed over the 
open mouth of the horn. This cover by itself would normally produce a 
troublesome reflection. However, in this antenna, it is a continuation of 
one of the side walls of the horn. Consequently, outgoing signals strike 
it at an oblique angle. Reflected energy from it is not focused by the 
parabolic section back at the waveguide, so the overall reflected power 
in the waveguide was found to be rather low. However, measuring it 
with this equipment, we found that an extra reflection appeared to 
come from a point 16 feet out in front of the mouth of the horn when the 
cover was in place. This is accounted for by the fact that energy re- 
flected obliquely from this cover bounces back and forth inside the 
horn before getting back into the waveguide, thus traveling the extra 
distance that makes the measurement seem to show that it comes from 
16 feet out in front. 


7. SEPARATION OF MODES ON A TIME BASIS 


If a pulse of energy is introduced into a moderate length of round 
waveguide to excite a number of modes which travel with different 
group velocities, and then observed farther along the line, or reflected 
from a piston at the end and observed at the beginning, separate pulses 
will be seen corresponding to each mode that is sent. This is illustrated 
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Fig. 12 — Arrangement for showing mode separation on a time basis in a multi- 
mode waveguide. The pulses in the trace photo have all traveled to the piston and 
back. The earlier outgoing pulse due to directional coupler unbalance is not shown. 
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in Fig. 12. In this arrangement energy was sent into the round line from 
a probe inserted in the side of the guide. This couples to all of the 12 
modes which can be supported, with the exception of the TE: circular 
electric mode. The sending end of the round guide was terminated. A 
directional coupler is connected to the sending probe so that the return 
from the piston at the far end can be observed on the receiver. Because 
of the different time that each mode takes to travel one round trip in 
this waveguide, which was 258 feet long, separate pulses are seen for 
each mode. The pulses in this figure have been marked to show which 
mode is being received. 

The time of each pulse referred to the outgoing pulse was measured 
and found to check very well with the calculated time. The formula for 
the time of transit in the waveguide for any mode is: 


L 
™ 0.988224/1 — v2 


where 7’ = time in millimicroseconds 


T 


L = length of pulse travel in feet 
Unm = A/Ne 
nN 


operating wavelength in air 


Xe 


cutoff wavelength of guide for the-mode involved. 


TABLE I — CALCULATED AND MEASURED VALUE OF TIME FOR ONE 
Rounp TrIe 





Time in Millimicroseconds 











Mode Number Mode Designation 
Calculated Measured 

1 TEn 545 545 
2 TMa 561 561 
3 TEa 587 587 
4 TMy 634 634 
5 TEa 634 — 
6 TEs; 665 665 
7 TMa, 795 793 
8 Tha 835 838 
9 TEy. 838 — 

10 TM 890 890 

11 TMs; 1461 — 

12 TEs; 1519 — 
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The calculated and measured value of time for one round trip is given 
in Table I. 

In this experiment the operating wavelength was 3.35 centimeters. 
This was obtained by measurements based on group velocity in a num- 
ber of guides as well as information about the pulse generator com- 
ponents. It represents an effective wavelength giving correct time of 
travel. The pulse occupies such a wide bandwidth that a measurement 
of its wavelength is difficult by the usual means. 

The dashes in the measured column indicate that the mode was not 
excited by the probe or was too weak to measure. These modes do not 
appear on the oscilloscope trace photograph. 

The relative pulse heights can be calculated from a knowledge of the 
probe coupling factors and the line loss. The probe coupling factors as 
given by M. Aronoff in unpublished work are expressed by the following: 

For TEnm modes: 


n* Ngnm 
2 2 
oom —- 7 Ng 11 





P = 2.890 


For TM,» modes: 





P .= 1.195¢, —— 
: Nou Nonm 


where 
P = ratio of probe coupling power in mode nm to that in mode TH 
n = first index of mode being calculated 
Kum = Bessel function zero value for mode being calculated = ad/)- 
\ = wavelength in air 
\y = wavelength in the guide for the mode involved 
Ac = cutoff wavelength of guide for the mode involved 
e, = lforn =0 
e, = 2forn 40 
d = waveguide diameter 


Formulas for guide loss as given by 8. A. Schelkunoff on page 390 of 
his book Electromagnetic Waves for this case where the resistivity of the 
aluminum guide is 4.14 & 10-* ohms per cm cube are: 
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For TE,,.. modes: 


a = 3,805 Kae — 7 + nm | CL = Yam) 
For TM,,, modes: 
a = 3.805(1 — vam)? 


where: 
a = attenuation of this aluminum guide in db 
n = first index of mode being calculated 
Knm = Bessel function zero value for mode being calculated = zd/)- 


Unm = A/Xe 


\ = operating wavelength in air 
A. = cutoff wavelength of guide for the mode involved 
d = waveguide diameter | 


Table II gives the calculated probe coupling factor, line loss, and rela- 
tive pulse height for each mode. In the calculation of the latter, wave 
ellipticity and loss due to mode conversion were neglected, but the heat 
loss given by the preceding formulas has been increased 20 per cent for 
all modes, to take account of surface roughness. Relative pulse heights 
were obtained by subtracting the relative line loss from twice the rela- 
tive probe coupling factor. The relative line loss is the number in the 
table minus 2.33 db, the loss for the TE, mode. 

The actual pulse heights on the photo of the trace on Fig. 12 are in 
fair agreement with these calculated values. Differences are probably 
due to polarization rotation in the guide (wave ellipticity) and conver- 
sion to other modes, effects which were neglected in the calculations, 
and which are different for different modes. 

Calculated pulse heights with this guide length, except for modes 
near cutoff, vary less than the probe coupling factors, because line loss 
is high when tight probe coupling exists. This is to be expected, since 
both are the result of high fields near the guide walls. 

The table of round trip travel time shows that the TE and TEh. 
modes are separated by only three millimicroseconds after the round 
trip in this waveguide. They would not be resolved as separate pulses 
by this equipment. However, the table of calculated pulse heights shows 
that the TE, pulse should be about 22 db higher than the TE, pulse. 
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TasLE II — CancuLaTEeD Prope Covupiine Factor, LiInE Loss anp 
Putsré Hretaut ror Eacu Mopr 

















Mod Mod Relative Probe 1.2 X Theoretical | Calculated Relativ 
raber Desienscgn OMBH ee Eevceor Linc Leet Pulse Heights db” 

1 Thy, 0 2.33 0 

2 TMa +0.32 4.88 —1.91 

3 TE +2.86 4.85 +3.20 

4 TMn +2.80 5.51 +2.42 

5 TE — 2 1.73 —2 

6 TEs; +4.82 8.21 +3.76 

7 TMa +1.82 6.92 —0.95 

8 Tha +6.80 13.86 +2.07 

9 TE, —8.73 4.70 —19.838 

10 TMo —1.68 7.74 —8.77 

11 M3; —0.82 12.71 —12.02 

12 TEs +10.14 32.09 —9.48 











Since the TE. pulse is so weak, it would not show on the trace even if 
it were resolved on a time basis. Coupling to the TMoz mode is rather 
weak, and the gain was increased somewhat at its position on the trace 
to show its time location. 


8. DELAY DISTORTION 


Another effect of the wide bandwidth of the pulses used with this 
equipment can be observed in Fig. 12. The pulses that have traveled 
for a longer time in the guide are in the modes closer to cutoff, and are 
on the right-hand side of the oscilloscope trace. They are broadened 
and disterted compared with the ones on the left-hand side. This effect 
is due to delay distortion in the guide. This can be explained by refer- 
ence to Fig. 13. On this figure the ratio of group velocity to the velocity 
in an unbounded medium is shown plotted as a function of frequency 
for each of the modes that can be propagated. The bandwidth of the 
transmitted pulse is indicated by the vertical shaded area. It will be 
noticed that the spacing of the pulses on the oscilloscope trace on Fig. 
12 from left to right in time corresponds to the spacing of the group 
velocity curves in the bandwidth of the pulse from top to bottom. De- 
lay distortion on these curves is shown by the slope of the line across 
the pulse bandwidth. If the line were horizontal, showing the same group 
velocity at all points in the band, there would be no delay distortion. 


The greater the difference in group velocity at the two edges of the - 


band, the greater the delay distortion. The curves of Fig. 13 indicate 
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that there should be increasing amounts of delay distortion reading 
from top to bottom for the pulse bandwidth used in these experiments. 
The effect of this delay distortion is to cause a broadening of the pulse. 
Examination of the pulse pattern of Fig. 12 shows that the later pulses 
corresponding in mode designation to‘the lower curves of Fig. 18 do in- 
deed show a broadening due to the increased delay distortion. One 
method of reducing the effect of delay distortion is to use frequency 
division multiplex so that each signal uses a smaller bandwidth. Another 
way, suggested by D. H. Ring, is to invert the band in a section of the 
waveguide between one pair of repeaters compared with that between 
an adjacent pair af repeaters so that the slope is, in effect, placed in the 
opposite direction, and delay distortion tends to cancel out, to a first 
order at least. 

The quantitative magnitude of delay distortion has been expressed 
by 8. Darlington in terms of the modulating base-band frequency 
needed to generate two side frequencies which suffer a relative phase 
error of 180° in traversing the line. This would cause cancellation of a 
single frequency AM signal, and severe distortion using any of the 
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Fig. 138 — Theoretical group velocity vs. frequency curves for the 3” diameter 
waveguide used for the tests shown on Fig. 12. The vertical shaded area gives the 
bandwidth for the millimicrosecond pulses employed in that arrangement. 
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ordinary modulation methods. Darlington gives this formula: 


1/2 2\3/4 
weir) “= 
where: ° 
fB = base bandwidth for 180° out of phase sidebands | 
= operating frequency (in same units as fB) 
= wavelength in air 
L = waveguide length (in same units as \) z 
Unm = A/Ne 
A. = cutoff wavelength for the mode involved 


With this equipment, the base bandwidth of the pulse is about 175 
mc, and when fB from the formula above is about equal to or less than 
this, pulse distortion should be observed. The following Table III gives 
f{B calculated from this formula for the arrangement shown on Fig. 12. 

It is interesting to note that pulses in the TMi, and TE, modes, for 
which fB is less than the 175-me pulse bandwidth, are broadened, but 
not badly distorted. For the higher modes, where fB is much less than 
175 mc, broadening and severe distortion are evident. Another example 
is given in the next section. 


9. DELAY DISTORTION EQUALIZATION 


If the distance which a pulse travels in a waveguide is increased, its 
delay distortion also increases. Since the group velocity at one edge of 
the band is different than at the other edge of the band, the amount 
by which the two edges get out of phase with each other increases with 
the total length of travel, causing increased distortion and pulse broaden- 
ing. The Darlington formula in the previous section shows that fB 
varies inversely as the square root of the length of travel. This effect 
is shown on Fig. 14. In this arrangement the transmitter was connected 
to the end of a 3” diameter round waveguide 107 feet long through a 
small hole in the end plate. A mode filter was used so that only the 
TE. mode would be transmitted in this waveguide. Through another 
small hole in the end plate polarized 90° from the first one, and rotated 
90° around the plate, a directional coupler was connected as shown. 
The direct through guide of this directional coupler could be short cir- 
cuited with a waveguide shorting switch. Energy reflected from this 
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Tas_e ITI — CatcuLtatep VALUES oF fB For THE ARRANGEMENT 
SHown IN Fia. 12 














Mode Number Mode Designation {B Megacycles Remarks 

1 TE, 324.0 

2 TMo 237.7 

3 TE 174.9 

4 TMy 124.1 

5 TE 124.1 Not excited 

6 TEs 105.2 

7 TMs 65.9 

8 TEa 59.1 

9 TE. 58 .6 Very weakly excited 
10 TMoe 51.8 
11 TMs: 21.3 Not observed 
12 TEs, 20.0 Not observed 
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Fig. 14 — The left-hand series of pulses shows the build up of delay distortion 
with increasing number of round trips in a long waveguide. The right-hand series 
shows the improvement obtained with the tapered delay distortion equalizer 
shown at the right. 
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switch was then taken through the directional coupler to the receiver 
as shown by the output arrow. The series of pulses at the left-hand 
photograph of the oscilloscope traces was taken with this waveguide 
shorting switch closed. The top pulse shows the direct leakage across 
the inside of the end plate before it has traveled through the 3” round 
guide. The next pulse is marked one round trip, having gone therefore 
214 feet in the TE: mode in the round waveguide. The successive pulses 
have traveled more round trips as shown by the number in the center 
between the two photographs. The effect of increased delay distortion 
broadening and distorting the pulse can be seen as the numbers increase. 
The values of fB from the Darlington formula in the previous section 
for these lengths are given in Table IV. 

It will be noticed that pulse broadening, and eventually severe dis- 
tortion, occurs as fB decreases much below the 175-me pulse band- 
width. The effect is gradual, and not too bad a pulse shape is seen until 
fB is about half the pulse bandwidth, although broadening is very 
evident earlier. 

When the waveguide short-circuiting switch was opened so that the 
tapered delay distortion equalizer was used to reflect the energy, in- 
stead of the switch, the series of pulses at the right was observed on 
the indicator. It will be noted that there is much less distortion of these 
pulses, particularly toward the bottom of the series. The ones at the top 
have less distortion than would be expected, probably because of fre- 
quency modulation of the injected pulse. The equalizer consists of a 
long gradually tapered section of waveguide which has its size reduced 
to a point beyond cutoff for the frequencies involved. Reflection takes 
place at the point of cutoff in this tapered guide. For the high frequency 
part of the pulse bandwidth, this point is farther away from the short- 
ing switch than for the low frequency part of the bandwidth. Conse- 
quently, the high frequency part of the pulse travels farther in one round 
trip into this tapered section and back than the low frequency part of 


TasBLe [IV — VauvuEs or fB FRoM THE DARLINGTON FORMULA 
FOR THE ARRANGEMENT SHOWN IN Fic. 14 








Round Trip Number JB Megacycles Round Trip Number SB Megacycles 
1 185.8 6 75.8 
2 131.4 7 70.2 
3 107.3 8 65.7 
4 92.9 9 61.9 
5 83.1 10 58.7 
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the pulse. This increased time of travel compensates for the shorter 
time of travel of the high frequency edge of the band in the 3” round 
waveguide, so equalization takes place. Since this waveguide close to 
cutoff introduces considerable delay distortion by itself, the taper effect 
must be made larger in order to secure the equalization. This can be 
done by making the taper sufficiently gradual. This type of equalizer 
introduces a rather high loss in the system. For this reason it might 
be used to predistort the signal at an early level in a repeater system. 
Equalization by this method was suggested by J. R. Pierce. 


10. MEASURING MODE CONVERSION FROM ISOLATED SOURCES 


One of the important uses of this equipment has been for the meas- 
‘urement of mode conversion. W. D. Warters has cooperated in develop- 
ing techniques and carrying out such measurements. One of the prob- 
lems in the design of mode filters used for suppressing all modes except 
the circular electric ones in round multimode guides is mode conversion. 
Since these mode filters have circular symmetry, conversion can take 
place only to circular electric modes of order higher than the TE; mode. 
This conversion is, however, a troublesome one, since these higher 
order circular modes cannot be suppressed by the usual type of filter. 

An arrangement for measuring mode conversion at such mode filters 
from the TEo: to the TEy, mode is being used with the short pulse equip- 
ment. This employs a 400-foot long section of the 5” diameter line. Be- 
cause the coupled-line transducer available had too high a loss to TE , a 
combined TE : — TE, transducer was assembled. It uses one-half of 
the round waveguide to couple to each mode. Fig. 15 shows this device. 

The use of this transducer and line is illustrated in Fig. 16. Pulses in 
the TE»: mode are sent into the waveguide by the upper section of the 
transducer as shown. Some of the TE: energy goes directly across to 
the TE: transducer and appears as the outgoing pulse with a level 
down about 32 db. This is useful as a time reference in the system and 
is shown as the outgoing pulse in the photo of the oscilloscope trace 
above. The main energy in the TE»: mode propagates down the line as 
shown by dashed line 2, which is the path of this wave. Most of 
this energy goes all the way to the reflecting piston at the far end and 
then returns to the TEo, transducer where it gives a pulse which is 
marked TE: round trip on the trace photograph above. Two thirds of 
the way from the sending end to the piston, the mode filter being meas- 
ured is inserted in the line. When the TE), mode energy comes to this _ 
mode filter, a small amount of it is converted to the TE. mode. This 


58 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 





Fig. 15 — A special experimental transducer for injecting the TE»; mode and 
receiving the converted TE. mode in a 5” diameter waveguide. 


continues to the piston by path 4 (with dashed lines and crosses) | 
and then returns and is received by the TE. part of the transducer. 
This appears on the trace photo as the TE. first conversion. When the 
main TE. energy reflected by the piston comes back to the mode filter, 
conversion again takes place to TE. This is shown by path 3 hav- 
ing dashed lines and circles. This returns to the TE, part of the trans- 
ducer and appears on the trace photo as the TE. second conversion. 
In addition, a small amount of energy in the TE: mode is generated 
by the TEo: upper part of the transducer. It is shown by path 5, having 


TEor TEo2 TEo2 TEo2 
ROUND SECOND FIRST ROUND 
OUTGOING PULSE TRIP CONVERSION CONVERSION TRIP 





TEg2 MODE FILTER 


Fic. 16 — Trace photos and waveguide paths traveled when measuring TE9, 
to TEo2 mode conversion at a mode filter with the transducer shown on Fig. 15. 
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short dashes. This goes down through the waveguide to the far end 
piston and back, and is received by the TE. transducer and shown as 
the pulse marked TE). round trip. The pulse marked TE round trip 
has a time separation from the outgoing pulse which is determined by 
the group velocity of TE, waves going one round trip in the guide. The 
TEo2 round trip pulse appears at a time corresponding to the group 
velocity of the TEo2 mode going one round trip in the guide. Spacing the 
mode filter two-thirds of the way down produces the two conversion 
pulses equally spaced between these two as shown in Fig. 16. The first 
conversion pulse appears at a time which is the sum of the time taken 
for the TEp: to go down to the filter and the TEx: to go from the filter 
to the piston and back to the receiver. Because of the slower velocity 
of the TE. , this appears at the time shown, since it was in the TEp: 
mode for a longer time than it was in the TE, mode. The second con- 
version, which happened when TE: came back to the mode filter, comes 
earlier in time than the first conversion, since the path for this signal 
was in the TE; mode longer than it was in the TEo. mode. This arrange- 
ment gives very good time separation, and makes possible a measure- 
ment of the amount of mode conversion taking place in the mode filters. 
Mode conversion from TE to TE»: as low as 50 to 55 db down, can be 
measured with this equipment. 

Randomly spaced single discontinuities in long waveguides can be 
located by this technique if they are separated far enough to give in- 
dividually resolved short pulses in the converted mode. Fig. 17 shows 


PAT. 
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FIRST CONVERSION AT FAR END SECOND CONVERSION 
AT NEAR END ~ SQUEEZED AT NEAR END 
SQUEEZED SECTION SECTION SQUEEZED SECTION 
TO RECEIVER 





Teo > TES TES <—> TEI9 NEAR END 250 FT OF FAR END 
TRANSDUCER COUPLED LINE SQUEEZED 3”°DIAMROUND SQUEEZED 
TRANSDUCER SECTION GUIDE SECTION 


Fig. 17 — Arrangement used to explain the measurement and location of mode 
conversion from isolated sources. A deliberately squeezed section was placed 
at each end of the long waveguide, producing the pulses shown in the trace photo. 
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an arrangement having oval sections deliberately placed’ in the wave- 
guide in order to explain the method. Pure THs excitation is used, and 
the converted TE,; mode observed with a coupled line transducer giv- 
ing an output for that mode alone. 

Let us consider first what would happen with the far-end squeezed 
section alone, omitting the near-end squeezed section from considera- 
tion. The injected TE : mode signal would then travel down the 250 
feet of 3” diameter round waveguide to the far end with substantially 
no mode conversion at the level being measured. At this point it goes 
through the squeezed section. Conversion now takes place from the TEo1 
mode to the TE.; mode. Both these modes after reflection from the piston 
travel back up the waveguide to the sending end. The group velocity 
of the TE: mode is higher than the group velocity of the TE: mode, so 
energy in these two modes separates, and if a coupling system were 
used to receive energy in both modes, two pulses would appear, with a 
time separation between them. In this case, since the receiver is con- 
nected to the line through the coupled line transducer which is responsive 
only to the TE. mode, only one pulse is seen, that due to this mode 
alone. This is the center pulse in the trace photograph at the top of 
Fig. 17. If only one mode conversion point at the far end of the guide 
exists, only this one pulse is seen at the receiver. It would be spaced a 
distance away from the injected outgoing pulse that corresponds in 
time to one trip of the TE: mode down to the far end and one trip of 
the TE,: mode from the far end back to the receiver. 

Now let us consider what would happen if the near-end squeezed sec- 
tion alone were present. When the TE: wave passes the oval section 
just beyond the coupled line transducer, conversion takes place, and 
the energy travels down the line in both the TEo; and the TE.; modes, 
at a higher group velocity in the TE,, mode. These two signals are re- 
flected by the piston at the far end and return to the sending end. The 
TE.; signal comes through the coupled line transducer and appears as 
the pulse at the left of the photo shown on Fig. 17. Now the TE: energy 
has lagged behind the TE» energy, and when it gets back to the near- 
end squeezed section, a second mode conversion takes place, and TE. 
mode energy is produced which comes through the coupled line trans- 
ducer and appears at the receiver at the time of the right hand pulse. 
The spacing between these two pulses is equal to the difference in round 
trip times between the two modes. 

In general, for a single conversion source occurring at any point in 
the line, two pulses will appear on the scope. The spacing between these 
pulses corresponds to the difference in group velocity between the modes 
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from the point of the discontinuity down to the piston at the far end, 
and then back to the discontinuity. If the discontinuity is at the far 
end, this time difference becomes zero, and a single pulse is seen. By 
making a measurement of the pulse spacing, the location of a single 
conversion point can be determined. 

In the arrangement illustrated in Fig. 17, two isolated sources of 
conversion existed. They were spaced far enough apart so that they 
were resolved by this equipment, and all three pulses were observed. 
The two outside pulses were due to the first conversion point. The center 
pulse was caused by the other squeeze, which was right at the reflecting 
piston. If this conversion point had, been located back some distance 
from the piston, it would have produced two conversion pulses whose 
spacing could be used to determine the location of the conversion point. 

The coupled-line transducers are calibrated for coupling loss by send- 
ing the pulse through a directional coupler into the branch normally 
used for the output to the receiver. This gives a return loss from the 
directional coupler equal to twice the transducer loss plus the round 
trip line loss. 


11. MEASURING DISTRIBUTED MODE CONVERSION IN LONG WAVEGUIDES 


Measurements of mode conversion from TE to a number of other 
modes have been made with 5” diameter guides using this equipment. 
The arrangement of Fig. 18 was set up for this purpose. This is the same 
as Fig. 17, except that a long taper was used at the input end of the 5” 
waveguide, and a movable piston installed at the remote end. 

One of the converted modes studied with this apparatus arrange- 
ment was the TMi mode, which is produced by bends in the guide. 
This mode has the same velocity in the waveguide as the TE: mode. 
Therefore energy components converted at different points in the line 
stay in phase with the injected TE mode from which they are converted. 
There is never any time separation between these modes, and a single 
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Fig. 18 — Arrangement used for measuring mode conversion in the 5” diameter 
waveguides at Holmdel. 
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narrow pulse like the transmitted one is all that appears on the indicator 
oscilloscope. It is not possible from this to get any information about 
the location or extent of the conversion points in the line. Moving the 
far end piston does not change the relative phases of the modes, so no 
changes are seen in indicator pattern or pulse level as the piston is 
moved. For the Holmdel waveguides, which are about 500 feet long, 
the total round trip TM;; mode converted level varies from 32 to 36 db 
below the input TE mode level over a frequency range from 8,800 to 
9,600 me per second. 

All the other modes have velocities that are different than that of 
the TE: mode. When mode conversion takes place at many closely 
spaced points along the waveguide, the pulses from the various sources 
overlap, and phasing effects take place. In general, a filled-in pulse 
much longer than the injected one is observed. The maximum possible, 
but not necessary, pulse length is equal to the difference in time re- 
quired for the TE: mode and the converted mode to travel the total 
waveguide length being observed. The phasing effects within the broad- 
ened pulse change its height and shape as a function of frequency and 
line length. 

Measurements of mode conversion from TE: to TEs; in these wave- 
guides illustrate distributed sources and piston phasing effects. The 
TEs, mode has a group velocity 1.4 per cent slower than the TE mode. 
For a full round trip in the 500-foot lines, assuming conversion at the 
imput end, this causes a time separation of about two and one half 
pulse widths between these two modes. The received pulse is about two 
and a half times as long as the injected pulse, indicating rather closely 
spaced sources over the whole line length. For one far-end piston posi- 
tion, the received pattern is shown as the upper trace in Fig. 19. As 
the piston is moved, the center depressed part of the trace gradually 





Fig. 19 — Received pulse patterns with the arrangement of Fig. 18 used for 
studying conversion to the TE3; mode. 
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rises until the pattern shown in the lower trace is seen. As the piston 
is moved farther in the same direction the trace gradually changes to 
have the appearance of the upper photo again. Moving the far-end 
piston changes the phase of energy on the return trip, and thus it can 
be made to add to, or nearly cancel out, conversion components that 
originated ahead of the piston. When the time separation becomes 
great enough to prevent overlapping in the pulse width, phasing effects 
cannot take place, therefore, the beginning and end of the spread-out 
received pulse are not affected by moving the piston. Energy converted 
at the sending end of the guide travels the full round trip to the piston 
and back in the slower TEs: mode, and thus appears at the latest time, 
which is at the right-hand end of the received pulse. Conversion at the 
piston end returns at the center of the pulse, and conversion on the 
return trip comes at earlier times, at the left-hand part of the pulse. 
The TH: mode has less loss in the guide than the TEs3, mode. Since the 
energy in the earlier part of the received pulse spent a greater part of 
the trip in the lower loss TE: mode before conversion, the output is 
higher here, and slopes off toward the right, where the later returning 
energy has gone for a longer distance in the higher loss mode. The pulse 
height at the maximum shows the converted energy from that part of 
the line to be between 30 and 35 db below the incident TE energy 
level over the measured bandwidth. 

Measurements of mode conversion from TE to TE; in these wave- 
guides show these same effects, and also a phasing effect as a function 
of frequency. The TE mode has a group velocity 2.4 per cent faster 
than the TE; mode. For a full round trip in the guides, this is a time 
separation of about four pulse widths between the modes. At one fre- 
quency and one far-end piston position, the TE,; response shown as the 
top trace of Fig. 20 was obtained. Moving the far-end piston gradually 
changed this to the second trace from the top, and further piston mo- 
tion changed it back again. This is the same kind of piston phasing effect 
observed in the TEs, mode conversion studies. The irregular top of this 
broadened pulse indicates fewer conversion points than for the TEs 
mode, or phasing effects along the guide length. Since the TE: mode 
has a higher group velocity than the TEo: mode, energy converted at 
the beginning of the guide returns at the earlier or left-hand part of the 
pulse, and conversions on the return trip, having traveled Jonger in 
the slower TE mode, are on the right-hand side of the pulse. This is 
just the reverse of the situation for the TEs: mode. Since the loss in the 
TE. mode is higher than in the TEo mode, the right side of this broad- 
ened pulse is higher than the left side, as the energy in the left side has 
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gone further in the higher loss TE,,; mode. Conversions from the piston 
end of the guide return in the center of the pulse, and only in this re- 
gion do piston phasing effects appear. As the frequency is changed the 
pattern changes, until it reaches the extreme shape shown in the next- 
to-the-bottom trace, with this narrower pulse coming at a time corre- 
sponding to the center of the broadened pulse at the top. Further fre- 
quency change in the same direction returns the shape to that of the 
top traces. At the frequency giving the received pulse shown on the 
next-to-the-bottom trace, moving the far-end piston causes a gradual 
change to the shape shown on the lowest trace. This makes it appear 
as if the mode conversion were coming almost entirely from the part of 
the guide near the piston end at this frequency. The upper traces appear 
to show that more energy is converted at the transducer end of the 
waveguide at that frequency. It would seem that at certain frequencies 
some phase cancellation is taking place between conversion points 
spaced closely enough to overlap within the pulse width. At frequencies 
between the ones giving traces like this, the appearance is more like 
that shown for the TE3; mode on Fig. 19 except for the slope across the 
top of the pulse being reversed. The highest part of this TE. pulse is 





Fig. 20 — Received pulse patterns with the arrangement of Fig. 18 used for 
studying conversion to the TE. mode. 
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24 to 27 db below the injected TE: pulse level for the 5” diameter 
Holmdel waveguides. 


12. CONCLUDING REMARKS 


The high resolution obtainable with this millimicrosecond pulse 
equipment provides information difficult to obtain by any other means. 
These examples of its use in waveguide investigations indicate the 
possibilities of the method in research, design and testing procedures. 
It is being used for many other similar purposes in addition to the illus- 
trations given here, and no doubt many more uses will be found for 
such short pulses in the future. 
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Experiments on the Regeneration of 
Binary Microwave Pulses 


By O. E. DeLANGE 
(Manuscript received September 7, 1955) 


A simple device has been produced for regenerating binary pulses directly 
at microwave frequencies. To determine the capabilities of such devices one 
of them was included in a circulating test loop in which pulse groups were 
passed through the device a large number of times. Results indicate that 
even tn the presence of serious noise and bandwidth limitations pulses can 
be regenerated many times and still show no noticeable deterioration. Pic- 
tures of circulated PEE are included which illustrate performance of the 
regenerator. 


INTRODUCTION 


The chief advantage of a transmission system employing binary pulses 
resides in the possibility of regenerating such pulses at intervals along 
the route of transmission to prevent the accumulation of distortion due 
to noise, bandwidth limitations and other effects. This makes it possible 
to take the total allowable deterioration of signal in each section of a 
long relay system rather than having to make each link sufficiently good 
to prevent total accumulated distortion from becoming excessive. This 
has been pointed out by a number of writers.!? 

W. M. Goodall* has shown the feasibility of transmitting television 
signals in binary form. Such transmission requires a considerable amount 
of bandwidth; a seven digit system, for example, would require trans- 
mission of seventy million pulses per second. This need for wide bands 
makes the microwave range an attractive one in which to work. 8. E. 
Miller* has pointed out that a binary system employing regeneration 
might prove to be especially advantageous in waveguide transmission. 

1B.M. Oliver, J. R. Pierce and, C. KE. Shannon, The Philosophy of PCM, Proc. 
I. R. E., Nov., 1948 

2L. A. Meacham and E. Peterson, An Experimental Multichannel Pulse Code 
Modulation System of Toll Quality, BS. 1: J., Jan. 1948. 


3W.M. Goodall, Television by Pulse Code Modulation, B.S. T.J., Jan., 1951. 
48. I. Miller, Waveguide as a Communication Medium, "B.S. T. J. , Nov. "1954. 
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Fig. 1 — A typical regenerative repeater shown in block form. 
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That the Bell System is interested in the long-distance transmission 
of television and other broad-band signals is evident from the number 
of miles of such broad-band circuits, both coaxial cable and microwave 
radio,® now in service. These circuits provide high-grade transmission 
because each repeater was designed to have a very flat frequency charac- 
teristic and linear phase over a considerable bandwidth. Furthermore, 
these characteristics are very carefully maintained. For a binary pulse 
system employing regeneration the requirements on flatness of band and 
linearity of phase can be relaxed to a considerable degree. The compo- 
nents for such a system should, therefore, be simpler and less expensive 
to build and maintain. Reduced maintenance costs might well prove to 
be the chief virtue of the binary system. 

Since the chief advantage of a binary system lies in the possibility of 
regeneration it is obvious that a very important part of such a system is 
the regenerative repeater employed. Fig. 1 shows in block form a typical 
broad-band, microwave repeater. Here the input, which might come from 
either a radio antenna or from a waveguide, is first passed through a 
proper microwave filter then amplified, probably by a traveling-wave 
amplifier. The amplified pulses of energy are regenerated, filtered, am- 
plified and sent on to the next repeater. The experiment to be described 
here deals primarily with the block labeled ‘‘Regenerator”’ on Fig. 1. 

In these first experiments one of our main objectives was to keep the 
repeater as simple as possible. This suggests regeneration of pulses 
directly at microwave frequency, which for this experiment was chosen 
to be 4 kme. It was suggested by J. R. Pierce and W. D. Lewis, both of 
Bell Telephone Laboratories, that further simplification might be made 
possible by accepting only partial instead of complete regeneration. 
This suggestion was adopted. 

For the case of complete regeneration each incoming pulse inaugurates 
a new pulse, perfect in shape and correctly timed to be sent on to the 


5 A. A. Roetken, K. D. Smith and R. W. Friis, The TD-2 System, B.S. T. J., 
Oct., 1951, Part II. 
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next repeater. Thus noise and other disturbing effects are completely 
eliminated and the output of each repeater is identical to the original 
signal which entered the system. For the case of partial regeneration 
incoming pulses are retimed and reshaped only as well as is possible with 
simple equipment. Obviously the difference between complete and partial 
regeneration is one of degree. 

One object of the experiment was to determine how well such a partial 
regenerator would function and what price must be paid for employing 
partial instead of complete regeneration. The regenerator developed 
consists simply of a waveguide hybrid junction with a silicon crystal 
diode in each side arm. It appears to meet the requirement of simplicity 
in that it combines the functions of amplitude slicing and pulse retiming 
in one unit. A detailed description of this unit will be given later. Al- 
though the purpose of this experiment was to determine what could be 
accomplished in a very simple repeater we must keep in mind that 
superior performance would be obtained from a regenerator which ap- 
proached more nearly the ideal. For some applications the better re- 
generator might result in a more economical system even though the 
regenerator itself might be more complicated and more expensive to 
produce. 


METHOD OF TESTING 


The regeneration of pulses consists of two functions. The first function 
is that of removing amplitude distortions, the second is that of restoring 
each pulse to its proper time. The retiming problem divides into two 
parts the first of which is the actual retiming process and the second 
that of obtaining the proper timing pulses with which to perform this 
function. In a practical commercial system timing information at a 
repeater would probably be derived from the incoming signal pulses. 
There are a number of problems involved in this recovery of timing 
pulses. These are being studied at the present time but were avoided in 
the experiment described here by deriving such information from the 
local synchronizing gear. , 

Since the device we are dealing with only partially regenerates pulses 
it is not enough to study the performance of a single unit — we should 
like to have a large number operating in tandem so that we can observe 
what happens to pulses as they pass through one after another of these 
regenerators. To avoid the necessity of building a large number of units 
the pulse circulating technique of simulating a chain of repeaters was 
employed. Fig. 2 shows this circulating loop in block form. 
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Fig. 2 — The circulating loop. 


To provide RF test pulses for this loop the output of a 4 kmec, cw 
oscillator is gated by baseband pulse groups in a microwave gate or 
modulator. The resultant microwave pulses are fed into the loop (heavy 
line) through hybrid junction No. 1. They are then amplified by a trav- 
eling-wave amplifier the output of which is coupled to the pulse regen- 
erator through another hybrid junction (No. 2). The purpose of this 
hybrid is to provide a position for monitoring the input to the regen- 
erator. A monitoring position at the output of the regenerator is pro- 
vided by a third hybrid, the main output of which feeds a considerable 
length of waveguide which provides the necessary loop delay. At the far 
end of the waveguide another hybrid (No. 4) makes it possible to feed 
noise, which is derived from a traveling-wave amplifier, into the loop. 
The combined output after passing through a band pass filter is ampli- 
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fied by another traveling-wave amplifier and fed back into the loop in- 
put thus completing the circuit. 

The synchronizing equipment starts out with an oscillator going at 
approximately 78 ke. A pulse generator is locked in step with this os- 
cillator. The output of the pulser is a negative 3 microsecond pulse as 
shown in Fig. 3A. After being amplified to a level of about 75 volts 
this pulse is applied to the helix of the first traveling-wave tube to re- 
duce the gain of this tube during the 3-microsecond interval. Out of each 
12.8usec interval pulses are allowed to circulate for 9.8usec but are blocked 
for the remaining 3ysec thus allowing the loop to return to the quiescent 
condition once during each period as shown on Figs. 3A and 3C. 

The 8yusec pulse also synchronizes a short-pulse generator. This unit 
delivers pulses which are about 25 millimicroseconds long at the base 
and spaced by 12.8yusec, i.e., with a repetition frequency of 78 ke. See 
Fig. 3B. 

In order to simulate a PCM system it was decided to circulate pulse 
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Fig. 3 — Timing events in the circulating loop. 
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groups rather than individual pulses through the system. These were 
derived from the pulse group generator which is capable of delivering 
any number up to 5 pulses for each short input pulse. These pulses are 
about 15 milli-microseconds long at the base and spaced 25 milli-micro- 
seconds apart. The amplitude of each of these pulses can be adjusted 
independently to any value from zero to full amplitude making it pos- 
sible to set up any combination of the five pulses. These are the pulses 
which are used to gate, or modulate, the output of the 4-kme oscillator. 

The total delay around the waveguide loop including TW tubes, etc., 
was 0.4usec or 400 milli-microseconds. This was sufficient to allow time 
between pulse groups and yet short enough that groups could circulate 
24 times in the available 9.8usec interval. This can be seen from Figs. 
3C and 3D. The latter figure shows an expanded view of circulating 
pulse groups. The pulses in Group I are inserted into the loop at the 
beginning of each gating cycle, the remaining groups result from circu- 
lation around the loop. 

When all five pulses are present in the pulse groups the pulse repeti- 
tion frequency is 40 me. (Pulse interval 25 milli-microseconds). For this 
condition timing pulses should be supplied to the regenerator at the rate 
of 40 million per second. These pulses are supplied continuously and not 
in groups as is the case with the circulating pulses. See Fig. 3E. In order 
to maintain time coincidence between the circulating pulses and the tim- 
ing pulses the delay around the loop must be adjusted to be an exact 
multiple of the pulse spacing. In this experiment the loop delay is equal 
to 16-pulse intervals. Since timing pulses are obtained by harmonic 
generation from the quenching frequency as will be discussed later this 
frequency must be an exact submultiple of pulse repetition frequency.. 
In this experiment the ratio is 512 to 1. 

Although the above discussion is based on a five-pulse group and 
40-me repetition frequency it turned out that for most of the experi- 
ments described here it was preferable to drop out every other pulse, 
leaving three to a group and resulting in a 20-mc repetition frequency. 
The one exception to this is the limited-band-width experiment which 
will be described later. 

For all of the experiments described here timing pulses were derived 
from the 78-ke quenching frequency by harmonic generation. A pulse 
with a width of 25 milli-microseconds and with a 78-kc repetition fre- 
quency as shown in Fig. 3B supplied the input to the timing wave gen- 
erator. This generator consists of several stages of limiting amplifiers all 
tuned to 20 me, followed by a locked-in 20-me oscillator. The output of 
the amplifier consists of a train of 20-me sine waves with constant ampli- 
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tude for most of the #2.8usec period but falling off somewhat at the end 
of the period. This train locks in the oscillator which oscillates at a con- 
stant amplitude over the whole period and at a frequency of 20 me. 
Timing pulses obtained from the cathode circuit of the oscillator tube 
provided the timing waves for most of the experiments. For the experi- 
ment where a 40-me timing wave was required it was obtained from the 
20 mc train by means of a frequency doubler. For this case it is necessary 
for the output of the timing wave generator to remain constant in ampli- 
tude and fixed in phase for the 512-pulse interval between synchronizing 
pulses. 

In spite of the stringent requirements placed upon the timing equip- 
ment it functioned well and maintained synchronism over adequately 
long periods of time without adjustment. 


PERFORMANCE OF REGENERATOR 


Performance of the regenerator under various conditions is recorded 
on the accompanying illustrations of recovered pulse envelopes. The 
first experiment was to determine the effects of disturbances which arise 
at only one point in a system. Such effects were simulated by adding 
disturbances along with the group of pulses as they were fed into the 
circulating loop from the modulator. This is equivalent to having them 
occur at only the first repeater of the chain. 

Some of the first experiments also involved the use of extraneous 
pulses to represent noise or distortion since these pulses could be syn- 
chronized and thus studied more readily than could random effects. In 
Fig. 4A the first pulse at the left represents a desired digit pulse with 
its amplitude increased by a burst of noise, the second pulse represents 
a clean digit pulse, and the third pulse a burst of noise. This group is at 
the input to the regenerator. Fig. 4B shows the same group of pulses 
after traversing the regenerator once. The pulses are seen to be shortened 
due to the gating, or retiming, action. There is also seen to be some ampli- 
tude correction, i.e. the two desired pulses are of more nearly the same 
amplitude and the undesired pulse has been reduced in relative ampli- 
tude. After a few trips through the regenerator the pulse group was 
rendered practically perfect and remained so for the rest of the twenty- 
four trips around the loop. Fig. 4C shows the group after 24 trips. In 
another experiment pulses were circulated for 100 trips without deteri- 
oration. Nothing was found to indicate that regeneration could not be 
repeated indefinitely. 

Figs. 5A and 5B represent the same conditions as those of 4A and 4B 
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Fig. 4 — Effect of regeneration on disturbances which occur at only one re- 
peater. A — Input to regenerator, original signal. B — Output of regenerator, 
first trip. C — Output of regenerator, 24th trip. 





Fig. 5 — Effect of regeneration on disturbances which occur at only one re- 
peater. A — Input to regenerator, first four groups. B — Output of regenerator, 
first four groups. C — Output of regenerator, increased input level. 
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Fig. 6 — Effect of regeneration on disturbances which occur at only one re- 
peater. A — Input to regenerator, original signal. B — Output of regenerator, 
first trip. C — Output of regenerator, 24th trip. 


except that the oscilloscope sweep has been contracted in order to show 
the progressive effects produced by repeated passage of the signal through 
the regenerator. Fig. 5B shows that after the pulses have passed through 
the regenerator only twice all visible effects of the disturbances have 
been removed. Fig. 5C shows the effect of simply increasing the RF 
pulse input to the regenerator by approximately 4 db. The small “noise’’ 
pulse which in the previous case was quickly dropped out because of 
being below the slicing level has now come up above the slicing level 
and so builds up to full amplitude after only a few trips through the 
regenerator. Note that in the cases shown in Figs. 4 and 5 discrimination 
against unwanted pulses has been purely on an amplitude basis since 
the gate has been unblocked to pulses with amplitudes above the slicing 
level whenever one of these disturbing pulses was present. 

For Fig. 6A conditions are the same as for Fig. 4A except that an ad- 
ditional pulse has been added to simulate intersymbol noise or inter- 
ference. Fig. 6B indicates that after only one trip through the regenerator 
the effect of the added pulse is very small. After a few trips the effect 
is completely eliminated leaving a practically perfect group which con- 
tinues on for 24 trips as shown by Fig. 6C. For the intersymbol pulse, 
discrimination is on a time basis since this interference occurs at a time 
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Fig. 7 — Effect of a aac in amplitude without retiming. A — Output of 
regenerator, no timing, firt trip. B — Output of regenerator, no timing, 10th trip. 
Output of regenerator, no timing, 23rd trip. 


when no gating pulse is present and hence finds the gate blocked regard- 
less of amplitude. 

To show the need for retiming the pictures shown on Figs. 7 and 8 
were taken. These were taken with the amplitude slicer in operation but 
with the pulses not being retimed. Figs. 7A, 7B and 7C, respectively, 
show the output of the slicer for the first, tenth and twenty-third trips. 
After ten trips, there is noticeable time jitter caused by residual noise 
in the system; after 23 trips this jitter has become severe though pulses 
are still recognizable. It should be pointed out that for this experiment 
no noise was purposely added to the system and hence the signal-to- 
noise ratio was much better than that which would probably be encoun- 
tered in an operating system. For such a system we would expect time 
jitter effects to build up much more rapidly. For Fig. 8 conditions are 
the same as for Fig. 7 except that the pulse spacing is decreased by the 
addition of an extra pulse at the input. Now, after ten trips, time jitter 
is bad and after 23 trips the pulse group has become little more than a 
smear. This increased distortion is probably due to the fact that less 
jitter is now required to cause overlap of pulses. There may also be some 
effects due to change of duty cycle. For Fig. 9 there was neither slicing 
nor retiming of pulses. Here, pulse groups deteriorate very rapidly to 
nothing more than blobs of energy. Note that there is an increase of 
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Fig. 8 — Effect of regenerating in amplitude without retiming. A — Output of 
regenerator, no timing, first trip. B — Output of regenerator, no timing, 10th 
trip. C — Output of regenerator, no timing, 23rd trip. 





Fig. 9 — Pulses circulating through the loop without regeneration. A — Origi- 
nal input. B — 4th trip without regeneration. C — 20th to 24th trip without re- 
generation. 
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Fig. 10 — The regeneration of band-limited pulses. A — Input to regencrator, 
first two groups. B — Output of regenerator, first two groups. C — Output of 
regenerator, 24th trip. 


amplitude with each trip around the loop indicating that loop gain was 
slightly greater than unity. Without the slicer it is difficult to set the 
gain to exactly unity and the amplitude tends to either increase or de- 
crease depending upon whether the gain is greater or less than unity. 
Results indicated by the pictures of Fig. 9 are possibly not typical of a 
properly functioning system but do show what happened in this par- 
ticular system when regeneration was dispensed with. 

Another important function of regeneration is that of overcoming 
band-limiting effects. Figs. 10 and 11 show what can be accomplished. 
For this experiment the pulse groups inserted into the loop were as shown 
at the left in Fig. 10A. These pulses were 15 milli-microseconds wide at 
the base and spaced by 25 milli-microseconds which corresponds to a 
repetition frequency of 40 mc. After passing through a band-pass filter 
these pulses were distorted to the extent shown at the right in Fig. 10A. 
From the characteristic of the filter, as shown on Fig. 12, it is seen that 
the bandwidth employed is not very different from the theoretical min- 
imum required for double sideband transmission. This minimum char- 
acteristic is shown by the dashed lines on Fig. 12. Fig. 10B shows that 
at the output of the regenerator the effects of band limiting have been 
removed. This is borne out by Fig. 10C which shows that after 24 trips 
the code group was still practically perfect. It should be pointed out 
that the pulses traversed the filter once for each trip around the loop, 
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Fig. 11 — The regeneration of band-limited pulses. A — Input to regenerator, 
first two groups. B — Output of regenerator, first two groups. C — Output of re- 
generator, 24th trip. 


that isfor each trip the input to the regenerator was as shown at the right 
of Fig. 10A and the output as shown by Fig. 10B. It is important to 
note that Fig. 12 represents the frequency characteristic of a single link 
of the simulated system. The pictures of Fig. 11 show the same experi- 
ment but this time with a different code group. Any code group which 
we could set up with our five digit pulses was transmitted equally well. 

In order to determine the breaking point of the experimental system, 
broad-band noise obtained from a traveling-wave amplifier was added 
into the system as shown on Fig. 2. The breaking point of the system is 
the noise level which is just sufficient to start producing errors at the 
output of the system.* The noise is seen to be band-limited in exactly 
the same way as the signal. With the system adjusted to operate properly 
the level of added noise was increased to the point where errors became 
barely discernible after 24 trips around the loop. Noise level was now 
reduced slightly (no errors discernible) and the ratio of rms signal to rms 
noise measured. Fig. 13A shows the input to the regenerator for the 23rd 
and 24th trips with this amount of noise added. Note that the noise has 

* The type of noise employed has a Gaussian amplitude distribution and there- 
fore there was actually no definite breaking point — the rate at which errors oc- 
curred increased continuously as noise amplitude was increased. The breaking 
point was taken as the noise level at which errors became barely discernible on 


the viewing oscilloscope. More accurate measurements made in other experiments 
indicate that this is a fairly satisfactory criterion. 
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Fig. 12 — Characteristics of the band-pass microwave filter. 





Fig. 13. — The regeneration of pulses in the presence of broad-band, random 
noise added at each repeater. A — Input to regenerator, 23rd and 24th trips, 
broad-band noise added. B — Input to regenerator, 23rd and 24th trips, no added 
noise. C — 20-me timing wave. 
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Fig. 14 — The regeneration of pulses in the presence of interference occurring 
at each repeater. A — Original signal with added modulated carrier interference. 
B — Input to regenerator, 24th trip, modulated carrier interference. C — Output 
of regenerator, 24th trip, modulated carrier interference. 


produced a considerable broadening of the oscilloscope trace. Fig. 13B 
shows the same pulse groups with no added noise. These photographs are 
included to give some idea as to how bad the noise was at the breaking 
point of the system. Of course maximum noise peaks occur rather infre- 
quently and do not show on the photograph. At the output of the re- 
generator effects due to noise were barely discernible. This output looked 
so much like that shown at Fig. 14C that no separate photograph is 
shown for it. 3 

Figs. 14A, 14B and 14C show the effects of a different type of inter- 
ference upon the system. This disturbance was produced by adding into 
the system a carrier of exactly the same frequency as the signal carrier 
(4 kme) but modulated by a 14-mc wave, a frequency in the same order 
as the pulse rate. Here again the level of the interference was adjusted 
to be just below the breaking point of the system. A comparison between 
Figs. 14B and 14C gives convincing evidence that the regenerator has 
substantially restored the waveform. 

For the case of the interfering signal a ratio of signal to interference 
of 10 db on a peak-to-peak basis was measured when the interference 
was just below the breaking point of the system. This, of course, is 4 db 
above the theoretical value for a perfect regenerator. For the case of 
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broad-band random noise an rms signal to noise ratio of 20 db was meas- 
ured.* This compares with a ratio of 18 db as measured by Messrs. 
Meacham and Peterson for a system employing complete regencration 
and a single repeater. 

Recently, A. I*. Dietrich repeated the circulating loop experiment at 
a radio frequency of 11 kme. His determinations of required signal-to- 
noise ratios are sukstantially the same as those reported here. From the 
various experiments we conclude that for a long chain of properly func- 
tioning regenerative repeaters of the type discussed here practically 
perfect transmission is obtained as long as the signal-to-noise ratio at 
the input to each repeater is 20 db or better on an rms basis. In an operat- 
ing system it might be desirable to increase this ratio to 23 db to take 
care of deficiencies in automatic gain controls, power changes, ete. 

From the experiments we also conclude that the price we pay for using 
partial instead of complete regeneration is about 3 to 4 db increase in 
the required signal-to-noise ratio. In a radio system which provides a 
fading margin this penalty would be less since the probability that two 
or more adjacent links will reach maximum fades simultaneously is very 
small. Under these conditions only one repeater at a time would be near 
the breaking point and the system would behave much as though the 
repeater provided complete regeneration. 


TIMING 


Although we have considered the problem of retiming of signal pulses 
up to now we have not discussed the problem of obtaining the necessary 
timing pulses to perform this function, but have simply assumed that a 
source of such pulses was available. As was mentioned earlier timing 
pulses would probably be derived from the signal pulses in a practical 
system. These pulses would be fed into some narrow band amplifier 
tuned to pulse repetition frequency. The output of this circuit could be 
made to be a sine wave at repetition frequency if gaps between the input 
pulses were not too great. Timing pulses could be derived from this sine 
wave. This timing equipment could be similar to that used in these ex- 
periments and described earlier. Further study of the problems of ob- 
taining timing information is being made. 

* For Gaussian noise it is not possible to specify a theoretical value of minimum 
S/N ratio without specifying the tolerable percentage of errors. For the number of 
errors detectable on the oscilloscope it seems rasonable to assume a 12 db peak 


factor for the noise. The peak factor for the signal is 3 db. The 6 db peak S/N 
which would be required for an ideal regenerator then becomes 15 db on an rms 


basis. 
7+ L. A. Meacham and E. Peterson, B.S. T.J., p. 48, Jan., 1948. 
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Fig. 15A — Low-frequency equivalent of the partial regenerator. 





DESCRIPTION OF REGENERATOR 


This device regenerates pulses by performing on them the operations 
of “slicing” and retiming. ; 

An ideal slicer is a device with an input-output characteristics such as 
shown by the dashed lines of Fig. 15C. It is seen that for all input levels 
below the so-called slicing level transmission through the device is zero 
but that for all amplitudes greater than this value the output level is 
finite and constant. Thus, all input voltages which are less than the slic- 
ing level have no effect upon the output whereas all input voltages 
greater than the slicing level produce the same amplitude of output. 
Normally conditions are adjusted so that the slicing level is at one-half 
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Fig. 15B — Characteristics of the separate branches with differential bias. 
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Fig. 15C — Resultant output with differential bias. 
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Fig. 15D — Characteristics of the separate branches and resultant output with 
equal biases. 


of peak pulse amplitude — then at the output of the slicer there will be 
no effect whatsoever from disturbances unless these disturbances execed 
half of the pulse amplitude. It is this slicing action which removes the 
amplitude effects of noise. Time jitter effects are removed by retiming, 
i.e., the device is made to have high loss regardless of input level except 
at those times when a gating pulse is present. 

Fig. 15A shows schematically a low-frequency equivalent of the re- 
generator used in these experiments. Here an input line divides into two 
identical branches isolated from each other and each with a diode shunted 
across it. The outputs of the two branches are recombined through neces- 
sary isolators to form a single output. The phase of one branch is re- 
versed before recombination, so that the final output is the difference 
between the two individual outputs. 

Fig. 15B shows the input-output characteristics of the two branches 
when the diodes are biased back to be non-conducting by means of bias 
voltages V, and V2 respectively. For low levels the input-output char- 
acteristic of both branches will be linear and have a 45° slope. As soon 
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as the input voltage in a branch reaches a value equal to that of the back 
bias the diode will start to conduct, thus absorbing power and decrease 
the slope of the characteristic. The output of Branch 1 starts to flatten 
off when the input reaches the value V; , while the output of Branch 2 
does not flatten until the input reaches the value V2. The combined 
output, which is equal to the differences of the two branch outputs, is 
then that shown by the solid line of Fig. 15C and is seen to have a transi- 
tion region between a low output and a high output level. If the two 
branches are accurately balanced and if the signal voltage is large com- 
pared to the differential bias V2 — V, the transition becomes sharp and 
the device is a good slicer. 

If the two diodes are equally biased as shown on Fig. 15D the outputs 
of the two branches should be nearly equal regardless of input and the 
total output, which is the difference between the two branch outputs, 
will always be small. 

Fig. 16 shows a microwave equivalent of the circuit of Fig. 15A. In 
the microwave structure lengths of wave-guide replace the wire lines and 
branching, recombining and isolation are accomplished by means of 
hybrid junctions. The hybrid shown here is of the type known as the 1A 
junction. 

Fig. 17 shows another equivalent microwave structure employing only 
one hybrid. This is the type used in the experiments described here. The 
output consists of the combined energies reflected from the two side 
arms of the junction. With the junction connected as shown phase rela- 
tionships are such that the output is the difference between the reflec- 
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Fig. 16 — Microwave regenerator. 
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tions from the two side arms so that when conditions in the two arms 
are identical there is no output. The crystal diodes coupled to the side 
arms are equivalent to those shunted across the two lines of Fig. 15A. 

Fig. 18, which is a plot of the measured input-output characteristic 
of the regenerator used in the loop test, shows how the device acts as a 
combined slicer and retimer. Curve A, obtained with equal biases on the 
two diodes, is the characteristic with no gating pulse applied i.e. the 
diodes are normally biased in this manner. It is seen that this condition 
produces the maximum of loss through the device. By shifting one diode 
bias so as to produce a differential of 0.5 volt the characteristic changes 
to that of Curve B. This differential bias can be supplied by the timing 
pulse in such a way that this pulse shifts the characteristic from that 
shown at A to that shown at B thus decreasing the loss through the de- 
vice by some 12 to 15 db during the time the pulse is present. In this way 
the regenerator is made to act as a gate — though not an ideal one. 

We see from curve B that with the differential bias the device has the 
characteristic of a slicer — though again not ideal. For lower levels of 
input there is a region over which the input-output characteristic is 
square law with a one db change of input producing a two db change of 
output. This region is followed by another in which limiting is fairly 
pronounced. At the 8-db input level, which is the point at which limiting 
sets in, the loss through the regenerator was measured to be approxi- 
mately 12 db. The characteristic shown was found to be reproducible 
both in these experiments at 4 kme and in those by A. I’. Dietrich at 
11 kme. | 

For a perfect slicer only an infinitesimal change of input level is re- 
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Fig. 17 — Microwave regenerator employing a single hybrid junction. 
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Fig. 18 — Static characteristics of the regenerator employed in these experiments. 


quired to change the output from zero to maximum. The input level at 
which this transition takes place is the slicing level and has a very defi- 
nite value. For a characteristic such as that shown on Fig. 18 this point 
is not at all definite and the question arises as to how one determines the 
slicing level for such a device. Obviously this point should be somewhere 
on the portion of the characteristic where expansion takes place. In the 
case of the circulating loop the slicing level is the level for which total 
gain around the loop is exactly equal to unity. Why this is so can be seen 
from Fig. 19 which is a plot of gain versus input level for a repeater 
containing a slicer with a characteristic as shown by curve B of Fig. 18. 
Amplifiers are necessary in the loop to make up for loss through the re- 
generator and other components. For Fig. 19 we assume that these 
amplifiers have been adjusted so that gain around the loop is exactly 
unity for an input pulse having a peak amplitude corresponding to the 
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Fig 19 — Gain characteristics of a repeater providing partial regeneration. 


point P’ of Fig. 18. On Fig. 19 all other levels are shown in reference to 
this unity-gain value. — 

From Fig. 19 it is obvious that a pulse which starts out in the loop 
with a peak amplitude exactly equal to the reference, or slicing level, 
will continue to circulate without change of amplitude since for this 
level there is unity gain around the loop. A pulse with amplitude greater 
than the slicing level will have its amplitude increased by each passage 
through a regenerator until it eventually reaches a value of +6 db. It 
will continue to circulate at this amplitude, for here also the gain around 
the loop is unity.* Any pulse with peak amplitude less than the reference 
‘level will have its amplitude decreased by successive trips through the 
regenerator and eventually go to zero. We also see that the greater the 
departure of the amplitude of a pulse from the slicing level the more 
effect the regenerator has upon it. This means that the device acts much 
more powerfully on low level noise than on noise with pulse peaks near 
the slicing level. As examples consider first the case of noise peaks only 
1 db below slicing level at the input (peak S/N = 7 db). At this level 
there is a 1 db loss through the repeater so that at the output the noise 
peaks will be 2 db below reference to give a S/N ratio of 8 db. Next 


* Note that the +6-db level is at a point of stable equilibrium whereas at the 
slicing level equilibrium is unstable. 
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consider noise with a peak level 5 db below slicing level (S/N = 11 db) 
at the input. The loss at this level is 5 db resulting in a noise level 10 db 
below reference to give a S/N ratio of 16 db. We see that a 4 db improve- 
ment in S/N ratio at the input results in an 8 db improvement in this 
ratio at the output. 

Everything which was said above concerning the circulating loop ap- 
plies equally to a chain of identical repeaters. To set the effective slicing 
level at half amplitude at each repeater in a chain one would first find 
two points on the slicer characteristics such as P and P’ of Fig. 18. The 
point P should be in the region of expansion and P’ in the limiting region. 
Also the points should be so chosen that a 6 db increase of input from 
that at point P results in a 6 db increase in output at the point P’. If 
now at each repeater we adjust pulse peak amplitude at the slicer input 
to a value corresponding to that at point P’ we will have unity gain 
from one repeater to the next at levels corresponding to pulse peaks. 
We will also have unity gain at levels corresponding to one half of pulse 
amplitude. The effective slicing level is thus set at half amplitude. Ob- 
viously the procedure for setting the slicing level at some value other 
than half amplitude would be practically the same. It should be pointed 
out that although half amplitude is the preferred slicing level for base- 
band pulses this is not the case for carrier pulses. W. R. Bennett of Bell 
Telephone Laboratories has shown that for carrier pulses the probability 
that noise of a given power will reduce signal pulses below half amplitude 
is less than the probability that this same noise will exceed half ampli- 
tude. This comes about from the fact that for effective cancellation there 
must be a 180° phase relationship between noise and pulse carrier. For 
this reason the slicing level should be set slightly above half amplitude 
for a carrier pulse system. 

The difference in performance between a perfect slicer and one with 
characteristics such as shown on Fig. 18 are as follows: For the perfect 
slicer no effects from noise or other disturbances are passed from one 
repeater to the next. For the case of the imperfect regenerator some ef- 
fects are passed on and so tend to accumulate in a chain of repeaters. 
To prevent this accumulated noise from building up to the breaking 
point of the system it is necessary to make the signal-to-noise ratio at 
each repeater somewhat better than that which would be required with 
the ideal slicer. For the case of random noise the required S/N ratio 
seems to be about 5 or 6 db above the theoretical value. This is due in 
part to slicer deficiency and in part to other system imperfections. 
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CONCLUSIONS 


It is possible to build a simple device for regenerating pulses directly 
at microwave frequencies. A long chain of repeaters employing this 
regenerator should perform satisfactorily as long as the rms signal-to- 
noise ratio at each repeater is maintained at a value of 20 db or greater. 
There are a number of remaining problems which must be solved before 
we have a complete regenerative repeater. Some of these problems are: 
(1) Recovery of information for retiming from the incoming pulse train; 
(2) Automatic gain or level control to set the slicing level at each re- 
peater; (3) Simple, reliable, economical, broad-band microwave ampli- 
fiers. (4) Proper filters — both for transmitting and receiving. Traveling- 
wave tube development should eventually result in amplifiers which 
will meet all of the requirements set forth in (3) above. Any improve- 
ments which can be made in the regenerator without adding undue 
complications would also be advantageous. 
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| Crossbar Tandem as a Long Distance 
Switching System 


By A. 0. ADAM 
(Manuscript received March 4, 1955) 


Major toll switching features are being added to the crossbar tandem 
switching system for use at many of the important long distance switching 
centers of the nationwide network. These include automatic selection of one 
of several alternate routes to a particular destination, storing and sending 
forward digits as required, highly flexible code conversion for transmitting 
digits different from those received, and a translating arrangement to select 
the most direct route to a destination. The system ts designed to serve both 
operator and customer dialed long distance traffic. 


INTRODUCTION 


The crossbar tandem switching system,! originally designed for switch- 
ing between local dial offices, will now play an important role in nation- 
wide dialing. New features are now available or are being developed that 
will permit this system to switch all types of traffic. As a result, crossbar 
tandem offices will have widespread use at many of the important switch- 
ing centers of the nationwide switching network. 

This paper briefly reviews the crossbar tandem switching system and 
its application for local switching, followed by discussion of the general 
aspects of the nationwide switching plan and of the major new features 
required to adapt crossbar tandem to this plan. 


CROSSBAR TANDEM OFFICES USED FOR LOCAL SWITCHING 


Crossbar tandem offices are now used in many of the large metropolitan 
areas throughout the country for interconnecting all types of local dial 
offices. In these applications they perform three major functions. Basi- 
cally, they permit economies in trunking by combining small amounts of 
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traffic to and from the local offices into larger amounts for routing over 
common trunk groups to gain increased efficiency resulting in fewer over- 
all trunks. 

A second important function is to permit handling calls economically 
between different types of local offices which are not compatible from the 
standpoint of intercommunication by direct pulsing. Crossbar tandem 
offices serve to connect these offices and to supply the conversion from 
one type of pulsing to another where such incompatibilities exist. 

The third major function is that of centralization of equipment or 
services. For example, centralization of expensive charging equipment at 
a crossbar tandem office results in efficient use of such equipment and 
over-all lower cost as compared with furnishing this equipment at each 
local office requiring it. Examples of such equipment are remote control 
of zone registration and centralized automatic message accounting.? Cen- 
tralization of other services such as weather bureau, time-of-day and 
similar services can be furnished. 

The first crossbar tandem offices were installed in 1941 in New York, 
Detroit and San Francisco. These offices were equipped to interconnect 
local panel and No. 1 crossbar central offices in the metropolitan areas, 
and to complete calls to manual central offices in the same areas. The war 

years slowed both development and production and it was not until the 

late 40’s that many features now in use were placed in service. These 
later features enable customers in step-by-step local central offices on the 
fringes of the metropolitan areas to interconnect on a direct dialing basis 
with metropolitan area customers in panel, crossbar, manual and step- 
by-step central offices. This same development also permitted central 
offices in strictly step-by-step areas to be interconnected by a crossbar 
tandem office where direct interconnecting was not economical. Facilities 
were also made available in the crossbar tandem system for completing 
calls from switchboards where operators use dials or multifrequency key 
pulsing sets. 

Since a crossbar tandem office usually has access to all of the local 
offices in the area in which it is installed, it is attractive for handling 
short and long haul terminating traffic. The addition of toll terminal 
equipment at Gotham Tandem in New York City in 1947 permitted 
operators in New York State and northern New Jersey as well as distant 
operators to dial or key pulse directly into the tandem equipment for 
completion of calls to approximately 350 central offices in the New York 
metropolitan area. This method of completing these calls without the 
aid of the inward operators was a major advance in using tandem switch- 
ing equipment for speeding completion of out-of-town calls. 


CROSSBAR TANDEM AS A TOLL SWITCHING SYSTEM 93 


CROSSBAR TANDEM SWITCHING ARRANGEMENT 


The connections in a crossbar tandem office are established through 
crossbar switches mounted on incoming trunk link and outgoing office 
link frames shown on Fig. 1. The connections set up through these 
switches are controlled by equipment common to the crossbar tandem 
office which is held only long enough to set up each individual connec- 
tion. Senders and markers are the major common control circuits. 

The sender’s function is to register the digits of the called number, 
transmit the called office code to the marker and then, as subsequently 
directed by the marker, control the outpulsing to the next office. 

The marker’s function is to receive the code digits from the sender 
for translation, return information to the sender concerning the de- 
tails of the call, select an idle outgoing trunk to the called destination 
and close the transmission path through the crossbar switches from the 
incoming to the outgoing trunk. 


GENERAL ASPECTS OF NATIONWIDE DIALING 


Operator distance dialing, now used extensively throughout the 
country, as well as customer direct distance dialing are based on the 
division of the United States and Canada into numbering plan areas, 
interconnected by a national network through some 225 Control Switch- 
ing Points (CSP’s) equipped with automatic toll switching systems. 

An essential element of the nationwide dialing program is a universal 
numbering plan* wherein each customer will have a distinctive number 
which does not conflict with the number of any other customer. The 
method employed is to divide the United States and Canada geographi- 
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Fig. 1 — Crossbar tandem switching arrangement. 
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cally into more than 100 numbering plan areas and to give each of these 
a distinctive three digit code with either a 1 or 0 as the middle digit. 
Each numbering plan area will contain 500 or fewer local central offices 
each of which will be assigned a distinctive three-digit office code. 
Thus each of the telephones in the United States and Canada will have, 
for distance dialing purposes, a distinct identity consisting of a three 
digit area code, an office code of two letters and a numeral, and a sta- 
tion number of four digits. Under this plan, a customer will dial 7 digits 
to reach another customer in the same numbering area and 10 digits to 
reach a customer in a different numbering area, 

A further requirement for nationwide dialing of long distance calls is 
a fundamental plan‘ for automatic toll switching. The plan provides a 
systematic method of interconnecting all the local central offices and 
toll switching centers in the United States and Canada. As shown on 
Fig. 2, several local central offices or ‘‘end offices” are served by a single 
toll center or toll point that has trunks to a “home”’ primary center 
which serves a group of toll centers. Each primary center, has trunks to 
a “home” sectional center which serves a larger area of the country. 
Similarly, the entire toll dialing territory is divided into eleven very 
large areas called regions, each having a regional center to serve all the 
sectional centers in the region. One of the regional centers, probably 
St. Louis, Missouri, will be designated the national center. The homing 
arrangements are such that it is not necessary for end offices, toll centers, 
toll points and primary centers to home on the next higher ranking 
office since the complete final route chain is not necessary. For example, 
end offices may be served directly from any of the higher ranking switch- 
ing centers also shown in Fig. 2. 

Collectively, the national center, the regional centers, the sectional 
centers and the primary centers will constitute the control switching 
points for nationwide dialing. The basic switching centers and homing 
arrangements are illustrated in Fig. 38. 


TANDEM CROSSBAR FEATURES FOR NATIONWIDE DIALING’ 


The broad objective in developing new features for crossbar tandem 
is to provide a toll switching system that can be used in cities where 
the large capacity and the full versatility of the No. 4 toll crossbar 
switching system® may not be economical. 

The application of crossbar tandem two-wire switching systems at 
primary and sectional centers has been made possible by the extended 
use of high speed carrier systems. The echoes at the 2-wire crossbar 
tandem switching offices can be effectively reduced by providing a high 
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office balance and by the use of impedance compensators and fixed pads. 
A well balanced two-wire switching system, proper assignment of inter- 
toll trunk losses, and the use of carrier circuits with high speed of propa- 
gation will permit through switching with little or no impairment from 
an echo standpoint. 

The new features for crossbar tandem will provide arrangements 
necessary for operation at control switching points (CSP’s). These in- 
clude automatic alternate routing, the ability to store and send forward 
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digits as required, highly flexible code conversion (transmitting forward 
different digits for the area or office code instead of the dialed digits), 
prefixing digits ahead of the called office code, and six-digit translation. 


ALTERNATE ROUTING 


The control switching points will be interconnected by a final or 
“backbone” network of intertoll trunks engineered so that very few 
calls will be delayed. In addition, direct circuits between individual 
switching offices of all classes will be provided as warranted by the 
traffic density. These are called “high-usage”’ groups and are not en- 
gineered to handle all the traffic offered to them during the busy hour. 
Traffic offered to a high-usage group which finds all trunks busy will be 
automatically rerouted to alternate routes®:7 consisting of other high- 
usage groups or to the final trunk group. The ability of the crossbar 
tandem equipment at the control switching point to select one of several 
alternate routes automatically, when all choices in the first route are 
busy, contributes to the economy of the plant and provides additional 
protection against complete interruption of service when all circuits on 
a particular route are out of service. 

Fig. 4 shows a hypothetical example of alternate routing when a 
crossbar tandem office at South Bend, Indiana, receives a call destined 
for Youngstown, Ohio. To select an idle path, using this plan, the 
switching equipment at South Bend first tests the direct trunks to 
Youngstown. If these are all busy, it tests the direct trunks to Cleveland 
where the call would be completed over the final group to Youngstown. 
If the group to Cleveland is also busy, South Bend would test the group 
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Fig. 4 — Toll network — alternate routing. 
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to Pittsburgh and on its Jast attempt it would test the final group to 
Indianapolis. If the call were routed to Pittsburgh or Indianapolis, the 
switching equipment at these points would attempt by first choice and 
alternate routes to reach Youngstown. The final choice backbone route 
would be via Indianapolis, Chicago, St. Louis, Pittsburgh, Cleveland to 
Youngstown. Should all the trunks in any of the final groups tested be 
busy no further attempt to complete the call is made. It is unlikely 
that so many alternate routes would be provided in actual practice 
since crossbar tandem can test only a maximum of 240 trunks on each 
call and, in the case illustrated, the final trunk group to Indianapolis 
may be quite large. 

The method employed by the crossbar tandem marker in selecting 
the direct route and subsequent alternate routes is shown in simplified 
form on Fig. 5. As a result of the translating operation, the marker 
selects the first choice route relay, corresponding to the called destina- 
tion. Each route relay has a number of contacts which are connected to 
supply all the information required for proper routing of the call. Several 
of these contacts are used to indicate the equipment location of the 
trunks and the number of trunks to be tested. The marker tests all of 
the trunks in the direct route and if they are busy, the search for an 
idle trunk continues in the first alternate route which is brought into 
play from the “route advance” cross-connection shown on the sketch. 
As many as three alternate routes in addition to the first choice route 
can be tested in this manner. 


STORING AND SENDING FORWARD DIGITS AS REQUIRED 


The crossbar tandem equipment at control switching points must 
store all the digits received and send forward as many as are required to 
complete the call. 

The called number recorded at a switching point is in the form of 
ABX-XXXX if the call is to be completed in the same numbering 
plan area. If the called destination is in another area, the area code 
XOX or XIX precedes the 7 digit number. The area codes XOX or XIX 
and the local office code ABX are the digits used for routing purposes 
and are sufficient to complete the call regardless of the number of switch- 
ing points involved. Each control switching point is arranged to ad- 
vance the call towards its destination when these codes are received. 
If the next switching point is not in the numbering area of the called 
telephone, the complete ten-digit number is needed to advance the 
cal] toward its destination. If the next switching point is in the num- 
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bering area of the called telephone the area code is not needed and seven 
digits will suffice for completing the call. 

For example, suppose a call is originated by a customer in South 
Bend, Indiana, destined for customer NAtional 4-1234 in Washington, 
D.C. If it is assumed that the route to Washington is via a switching 
center in Pittsburgh, then the crossbar tandem equipment at South 
Bend pulses forward to Pittsburgh 202-NA4-1234, 202 being the area 
code for the District of Columbia. Pittsburgh in turn will delete the 
area code and send NA4-1234 to the District of Columbia terminating 
area. 

As another example, suppose the crossbar tandem office at South 
Bend receives a call from some foreign area destined to a nearby step- 
by-step end office in Michigan. The crossbar tandem equipment re- 
ceives and stores a ten-digit number comprising the area code and the 
seven digits for the office code and station number. Assuming that 
direct trunks to the step-by-step end office in Michigan are available, 
the area code and office code are deleted and the line number only is 
pulsed forward. To meet all conditions, the equipment is arranged to 
permit deletion of either the first three, four, five or six digits of a ten- 
digit number. 


CODE CONVERSION 


At the present time, some step-by-step primary centers reach other 
offices by the use of routing codes that are different from those assigned 
under the national numbering plan. This arrangement is used to obtain 
economies in switching equipment of the step-by-step plant and is 
accetpable with operator originated calls. However, with the intro- 
duction of customer direct distance dialing, it is essential that the codes 
used by customers be in accordance with the national numbering plan. 
The crossbar tandem control switching point must then automatically 
provide the routing codes needed by the intermediate step-by-step 
primary centers. This is accomplished by the code conversion feature 
which substitutes the arbitrary digits required to reach the called office 
through the step-by-step systems. Fig. 6 illustrates an application of 
this feature. It shows a crossbar tandem office arranged for completing 
calls through a step-by-step toll center to a local central office, GArden 
8, in an adjacent area. A call reaching the crossbar tandem office for a 
customer in this office arrives with the national number, 218-GA8-1234. 
To complete this call, the crossbar tandem equipment deletes the area 
code 218 and pulses forward the local office code and number. If the 
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call is switched to an alternate route via the step-by-step primary 
center, it will be necessary for the crossbar tandem equipment to delete 
the area code 218 and substitute the arbitrary digits 062 to direct the 
call through the switches at the primary center, since the toll center 
requires the full seven digit number for completing the call. 


PREFIXING DIGITS 


It may be necessary to route a call from one area to another and back 
to the original area for completion. Such a situation arises on a call 
from Amarillo to Lubbock, Texas, both in area 915 when the crossbar 
tandem switching equipment finds all of the direct paths from Amarillo 
to Lubbock busy as illustrated on Fig. 7. The call could be routed to 
Lubbock via Oklahoma City which is in area 405. A seven-digit number 
. for example, MAin 2-1234, is received in the crossbar tandem office at 
Amarillo. Assuming that the call is to be switched out of the 915 area 
through the 405 area and back to the 915 area for completion, it is 
necessary for the crossbar tandem office in Amarillo to prefix 915 to the 
MAin 2-1234 number so that the switching equipment in Oklahoma 
City will know that the call is for the 915 area and not for the 405 area. 

Prefixing digits may also be needed at crossbar tandem offices to 
route calls through step-by-step primary centers. The crossbar tandem 
office in Fig. 8 receives the seven digit number MA2-12384 for a call to a 
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customer in the Madison office in the same area. However, since the 
toll center needs the full seven digit number for completing the call and 
since the step-by-step switches at the primary center “use up” two 
digits (04) for its switching, the crossbar tandem equipment must 
prefix 04 to the seven digit number. 


METHOD OF DETERMINING DIGITS TO BE TRANSMITTED 


The circuitry involved for transmitting digits as received, prefixing, 
code conversion and for deletion involves both marker and sender 
functions. The senders have ten registers (1 to 10) for storing incoming 
digits and three registers (AA, AB, AC) for storing the arbitrary digits 
that are used for prefixing and code conversion. 

On a ten-digit call into a crossbar tandem switching center the area 
code XOX, the office code ABX and the station number XXXX are 
stored in the inpulsing or receiving registers of the sender. The code 
digits XOX-ABX are sent to the marker which translates them to 
determine which of the digits received by the sender should be outpulsed. 
It also determines whether arbitrary digits should be transmitted ahead 
of the digits received and, if so, the value of the arbitrary digits to be 
stored in the sender registers AA, AB and AC. Case 1 of Fig. 9 assumes 
that a ten-digit number has been stored in the sender registers 1 to 10 
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and that the marker has informed the sender the called number is to be 
sent as received. The outpulsing control circuit is connected to each 
register in turn through the steering circuit S1, $2, etc. and sends the 
digits stored. 

Case 2 illustrates a situation where the sender has stored ten digits 
in registers 1 to 10 and received information from the marker to delete 
the digits in registers 1 to 3 inclusive and to substitute the arbitrary 
digits stored in registers AA, AB and AC. The outpulsing circuit is 
first connected to register AA through steering circuit PS1, then to AB 
through P82, continuing in a left to right sequence until all digits are 
outpulsed. 

Case 3 covers a condition where the sender has stored seven digits and 
has obtained information from the marker to prefix the two digits 
stored in registers AB and AC. Outpulsing begins at the AB register 
through steering circuit PS2 and then advances through steering circuit 
PS3 to the AC register, continuing in a left to right sequence until all 
digits have been transmitted. 

These are only a few of the many combinations that are used to give 
the crossbar tandem control switching equipment complete pulsing 
flexibility. 


SIX-DIGIT TRANSLATION 


Six-digit translation will be another feature added to the crossbar 
tandem system. When only three digits are translated, it is necessary to 
direct all calls to a foreign area over a single route. The ability to trans- 
late six digits permits the establishment of two or more routes from the 
switching center to or towards the foreign area. This is shown in Fig. 
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10 with Madison and Milwaukee, Wisconsin, in area 414 and Belle 
Plaine Crossbar Tandem in Chicago, Illinois, in area 312. An economical 
trunking plan may provide for direct circuits from Chicago to each 
place. If only three-digit translation were provided in the Chicago 
switching equipment, the route to both places would be selected as a 
result of the translation of the 414 area code alone and, therefore, calls 
to central offices reached through Madison, would need to be routed 
via Milwaukee. This involves not only the extra trunk mileage, but 
also the use of an extra switching point. With six-digit translation, both 
the area code and the central office code are analyzed, making it 
possible to select the direct route to either city. 

Six-digit translation in crossbar tandem will involve primarily the 
use of a foreign area translator and a marker. The translator will have 
a capacity for translation of five foreign areas and for 60 routes to each 
area. Since the translator holding time is very short, one translator is 
sufficient to handle all of the calls requiring six-digit translation, but 
two are always provided for hazard and maintenance reasons. 

On a call requiring six-digit translation the first three digits are 
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translated in the marker and the second three digits in a foreign area 
translator which is associated with the marker. Fig. 11 shows, in simpli- 
fied form, how this translation is accomplished. 

The first three digits, corresponding to the area code, are received by 
a relay code tree in the marker which translates it into one of a thousand 
code points. This ecde point is cross-connected to the particular relay of 
the five area relays AO-A4 which has been assigned to the called area. 
A foreign area translator is now connected to the marker and a corre- 
sponding area relay is operated in it. The translator also receives the 
called office code from the sender via the marker and by means of a 
relay code tree similar to that in the marker translates the office code 
to one of a thousand code points. This code point plus the area relay is 
sufficient to determine the actual route to be used. As shown on the 
sketch, wires from each of the code points are threaded through trans- 
formers, two for each area. When the marker is ready to receive the 
route information, a surge of current is sent through one of these threaded 
wires which produces a voltage in the output winding to ionize the 
T- and U- tubes. Only the tubes associated with the area involved in 
the translation pass current to operate one each of the eight T- and U- 
relays. This information is passed to the marker and registered on 
correspondingetens and units relays. These operate a route relay which 
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provides all the information necessary for routing the call to the central] 
office involved. 


CUSTOMER DIRECT DISTANCE DIALING 


Crossbar tandem will provide arrangements permitting customers in 
step-by-step offices to dial their own calls anywhere in the country. 
Centralized automatic message accounting previously mentioned will 
be used for charging purposes. While the basic plan for direct distance 
dialing provides for the dialing of either seven or ten digits, it will be 
necessary for the customer in step-by-step areas to prefix a three-digit 
directing code, such as 112, to the called number. This directing code 
is required to direct the call through the step-by-step switches to the 
crossbar tandem office so that the seven or ten digit number can be 
registered in the crossbar tandem office. 

When a customer in a step-by-step office originates a call to a distant 
customer whose national number is 915-CH3-1234, he first dials the 
directing code 112 and then the ten-digit number. The dialing of 112 
causes the selectors in the step-by-step office to select an outgoing trunk 
to the crossbar tandem office. The incoming trunk in the crossbar tandem 
office has quick access to a three-digit register. The register must be 
connected during the interval between the last digit of the directing 
code and the first digit of the national number to insure registration of 
this number. This arrangement is used to permit the customer to dial 
all digits without delay and avoids the use of a second dial tone. If this 
arrangement were not used, the customer would be required to wait 
after dialing the 112 until the trunk in the tandem crossbar office could 
gain access to a sender through the sender link circuit which would 
then signal the customer to resume dialing by returning dial tone. 

After recording the 915 area code digits in the case assumed, the 
CH3-1234 portion of the number is registered directly in the tandem 
sender which has been connected to the trunk while the customer was 
dialing 915. When the sender is attached to the trunk, it signals the 
three-digit register to transfer the 915 area code digits to it via a con- 
nector circuit. Thus when dialing is complete, the entire number 915- 
CH8-1234 is registered in the sender. 

Crossbar tandem is being arranged to serve customers of panel and 
No. 1 crossbar offices for direct distance dialing. At the present time, 
ten digit direct distance dialing is not available to these customers 
because the digit storing equipments in these offices are limited to 
eight digits. Developments now under way, will provide arrangements 
for expanding the digit capacity in the local offices so that ultimately 
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calls from customers in panel and No. 1 crossbar offices may be routed 
through crossbar tandem cr other equivalent offices to telephones 
anywhere in the country. 


CONCLUSION 


The new features developed for crossbar tandem will adapt it to 
switching all types of traffic at many important switching centers of 
the nationwide toll network. Of the 225 important toll switching centers 
now contemplated, it is expected that about 80 of these will be equipped 
with crossbar tandem. 
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Growing Waves Due to Transverse 
Velocities 


By J. R. PIERCE and L. R. WALKER 
(Manuscript received March 30, 1955) 


This paper treats propagation of slow waves in two-dimensional neu- 
tralized electron flow in which all electrons have the same velocity in the 
direction of propagation but in which there are streams of two or more veloct- 
ties normal to the direction of propagation. In a finite beam in which 
electrons are reflected elastically at the boundaries and in which equal dc 
currents are carried by electrons with transverse velocities -+u, and —uUW, 
there is an antisymmetrical growing wave if 


Wp ~ (rin /W)” 
and a symmetrical growing wave if 
wp ~ $8 (mu/W) 


Here w, is plasma frequency for the total charge density and W 1s beam 
width. 


INTRODUCTION 


It is well-known that there can be growing waves in electron flow when 
the flow is composed of several streams of electrons having different 
velocities in the direction of propagation of the waves.’ While Birdsall’ 
considers the case of growing waves in electron flow consisting of streams 
which cross one another, the growing waves which he finds apparently 
occur when two streams have different components of velocity in the 
direction of propagation. 

This paper shows that there can be growing waves in electron flow 
consisting of two or more streams with the same component of velocity 
in the direction of wave propagation but with different components of 
velocity transverse to the direction of propagation. Such growing waves 
can exist when the electric field varies in strength across the flow. Such 
waves could result in the amplification of noise fluctuations in electron 
flow. They could also be used to amplify signals. 
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Actual electron flow as it occurs in practical tubes can exhibit trans- 
verse velocities. For instance, in Brillouin flow, ”: ° if we consider electron 
motion in a coordinate system rotating with the Larmor frequency we 
see that electrons with transverse velocities are free to cross the beam 
repeatedly, being reflected at the boundaries of the beam. The trans- 
verse velocities may be completely disorganized thermal velocities, or 
they may be larger and better-organized velocities due to aberrations at 
the edges of the cathode or at lenses or apertures. Two-dimensional 
Brillouin flow allows similar transverse motions. 

It would be difficult to treat the case of Brillouin or Brillouin-like flow 
with transverse velocities. Here, simpler cases with transverse velocities 
will be considered. The first case treated is that of infinite ion-neutra- 
lized two-dimensional flow with transverse velocities. The second case 
treated is that of two-dimensional flow in a beam of finite width in which 
the electrons are elastically reflected at the boundaries of the beam. 
Growing waves are found in both cases, and the rate of growth may be 
large. 

In the case of the finite beam both an antisymmetric mode and a 
symmetric mode are possible. Here, it appears, the current density 
required for a growing wave in the symmetric mode is about 44 times 
as great as the current density required for a growing wave in the anti- 
symmetric mode. Hence, as the current is increased, the first growing 
waves to arise might be antisymmetric modes, which could couple to a 
symmetrical resonator or helix only through a lack of symmetry or 
through high-level effects. 


1. Infinite two-dimensional flow 


Consider a two-dimensional problem in which the potential varies 
sinusoidally in the y direction, as exp(—76z) in the z direction and as exp 
(jwt) with time. Let there be two electron streams, each of a negative 
charge pp) and each moving with the velocity wo in the z direction, but 
with velocities u; and — wu: respectively in the y direction. Let us denote 
ac quantities pertaining to the first stream by subscripts 1 and ac quan- 
tities pertaining to the second stream by subscripts 2. The ac charge 
density will be denoted by p, the ac velocity in the y direction by 4, 
and the ac velocity in the z direction by 2. We will use linearized or 
small-signal equations of motion.’ We will denote differentiation with 
respect to y by the operator D. 

The equation of continuity gives 


jop, = —D(prts + pot) + JB(pivo + poés) (1.1) 
jop, = —D(—p2ti + potj2) + jB(pxto + prez) (1.2) 
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Let us define 
di = j(w — Buo) + mD 
dy, = j(w — Bu) — wD 
We can then rewrite (1.1) and (1.2) as 
dip: = po(—Diy + 7821) 
dep2 = po(—Dy2 + 7822) 
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(1.3) 
(1.4) 


(1.5) 
(1.6) 


We will assume that we are dealing with slow waves and can use a po- 
tential V to describe the field. We can thus write the linearized equations 


of motion in the form 


I 


ditt = —7 2 BV 
m 

dé. = —j— BV 
Mm 


Dy 
m 


diy 
wee 
do = - DV 
From (1.5) to (1.10) we obtain 
dip = -< po(D* — p°)V 
ds'p2 = —< p(D? — B°)V 
Now, Poisson’s equation is 
D’ —= 2 V = __Pl + p2 
( 8) = 
From (1.11) to (1.13) we obtain 


(Dt - BW = — Yo (1 +4) 0 - 8 
1 2 


e 
LD ay 

2 m 
Op = 





E 


Here w, is the plasma frequency for the charge of both beams. 


(1.7) 
(1.8) 
(1.9) 


(1.10) 


(1.11) 


(1.12) 
(1.13) 


(1.14) 


(1.15) 
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Either 
(D* — B’)V = 0 (1.16) 
or else 


tae (de ds) 


he eo age 


(1.17) 





We will consider this second ease. 
We should note from (1.3) and (1.4) that 


dy = uyD® — (w — Buo)? + 27D(w — Bu) (1.18) 
dy = uD? — (w — Bu)” — 27D — Buo)t (1.19) 
dy + dz = 2[uD* — (w — Bu)’ (1.20) 
dyde = [uD + (w — Bu)! (1.21) 
Thus, (1.17) becomes 
—wre lu D> — (w — Bu)’ 
[wD + (w — Buo)’? 
If the quantities involved vary sinusoidally with y as cos yy or sin yy, 
then 


1= (1.22) 


D? = —7 (1.23) 


: E (: ri eu) | 
t=“ aa (1.24) 
yur k = (: = =) 
yu 


What happens if we have many transverse velocities? If we refer back 
to (1.14) we see that we will have an equation of the form 


din +d -) 
— fo 1 2 ln 2n 
1 = 2) — Yoon (4g 5 ae (1.25) 


Our equation becomes 





Here wyn is a plasma frequency based on the density of electrons having 
transverse velocities +u, . Equation (1.25) can be written 


1=»>, pn E z : a) (1.26) 


yt [2 _ (= al 


Oa) a 
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(W- BU) —> 


Fig. 1 


Suppose we plot the left-hand and the right-hand sides of (1.26) versus 
(w — Buo). The general appearance of the left-hand and right-hand sides 
of (1.26) is indicated in Fig. 1 for the case of two velocities u, . There 
will always be two unattenuated waves at values of (wo — Buo)” > y'ue™ 
where u, is the extreme value of u,; these correspond to intersections 3 
and 3’ in Fig. 2. The other waves, two per value of u,, may be unat- 
tenuated or a pair of increasing and decreasing waves, depending on the 
values of the parameters. If 

2 


yo eS i 


Ur? 





there will be at least one pair of increasing and decreasing waves. 

It is not clear what will happen for a Maxwellian distribution of veloci- 
ties. However, we must remember that various aberrations might give a 
very different, strongly peaked velocity distribution. 

Let us consider the amount of gain in the case of one pair of transverse 
velocities, +; . The equation is now 


_— E +(2 _ i 
Y U1 ae yu (1.27) 





Wy? re (: a3 buoy 2 
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= o - yYuie 
B a) ; 


Let 





(1.28) 
U 
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Fig. 2 


This relation defines e. Equation (1.27) becomes 


yur _ 1- é (1 29) 
In Fig. 2, € is plotted versus the parameter y'u1’/w,. We see that as the » 
parameter falls below unity, e increases, at first rapidly, and then more 
slowly, reaching a value of -k1 as the parameter goes to zero (as w,” 
goes to infinity, for instance). 

It will be shown in Section 2 of this paper that these results for infinite 
flow are in some degree an approximation to the results for flow in narrow 
beams. It is therefore of interest to see what results they yield if applied 
to a beam of finite width. 

If the beam has a length L, the voltage gain is 


ef huiluo) L (1.30) 





The gain G in db is © 





G = 87 vast edb (1.31) 
0 
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Let the width of the beam be W. We let 


nw 
= 1.32 
Y= > (1.32) 
Thus, for » = 1, there is a half-cycle variation across the beam. From 
(1.31) and (1.32) 

G@ = 273 (2! r) ne db (1.33) 

Uo d 

Now L/uo is the time it takes the electrons to go from one end of the 
beam to the other, while W/w is the time it takes the electrons to cross 
the beam. If the electrons cross the beam N times 


U1 L 
= 1.34 
N a W (1.84) 
Thus, 
G = 27.3 Nne db (1.35) 


While for a given value of ¢ the gain is higher if we make the phase 
vary many times across the beam, i.e., if we make n large, we should | 
note that to get any gain at all we must have 


2 22 
Wp > YU 


2 (1.36) 
‘ie > (me 





W 


If we increase w,', which is proportional to current density, so that w,” 
passes through this value, the gain will rise sharply just after w,’ passes 
through this value and will rise less rapidly thereafter. 


2. A Two-Dimensional Beam of Finite Width. 


Let us assume a beam of finite width in the y-direction; the boundaries 
lying at y = -eyo. It will be assumed also that electrons incident upon 
these boundaries are elastically reflected, so that electrons of the incident 
stream (1 or 2) are converted into those of the other stream (2 or 1). The 
condition of elastic reflection implies that 


y= —Fe (2.1) 

4 = 2 aty = ty (2.2) 
and, in addition, that 

pi =p, aby = +Yo (2.3) 


since there is no change in the number of electrons at the boundary. 
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The equations of motion and of continuity (1.7—1.12) may be satisfied 
by introducing a single quantity, y, such that 


V=ed¢ day (2.4) 
a= —j as B di de (2.5) 
m 
f= —j- de dy (2.6) 
m 
=~ diddy (2.7) 
m 
Y= = dy’ d2aDy (2.8) 
m 
pi = oat p(D* — 6’) dep (2.9) 
m 
po = —< po(D® — *) dey (2.10) 


Then, if we introduce the symbol, 2, for # — Buo 


Hith= 25 = d; doDOY (2.11) 
aes Bee ee 25 < di deuDy (2.12) 
pi — po = 25 < po(D? — 6 )mQDy (2.18) 
It is clear that if 
Dy =DW=0 y= +y (2.14) 


the conditions for elastic reflection will be satisfied. The equation satis- 
fied by y may now be found from Poisson’s equation, (1-18), and is 


(D* — °) dé dey. = a (D? — B’)(d? + dey 
or 


(D? — B’)[(u"D* + 2°)’ + w,'(uD? — 2’)] = 0 (2.15) 


which is of the sixth degree in D. So far four boundary conditions have 
been imposed. The remaining necessary pair arise from matching the 
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internal fields to the external ones. For y > yo 


V = Ve. - (2.16) 
and | 
aV 
a +BV=0 aty = yo 
Similarly 
* — BV =0 at y = —Yo (2.17) 


The most familiar procedure now would be to look for solutions of 
(2.15) of the form, e”. This would give the sextic for c 


(c? — B)[(ure + 2°) + w, (ure? — 9)] = 0 (2.18) 


with the rootsc = +, +c, +c: , let us say. We could then express y 
as a linear combination of these six solutions and adjust the coefficients 
to satisfy the six boundary equations. In this way a characteristic equa- 
tion for 8 would be obtained. From the symmetry of the problem this 
has the general form F'(8, c:) = F(@, c2), where c; and ce are found from 
(2.18). The discussion of the problem in these terms is rather laborious 
and, if we are concerned mainly with examining qualitatively the onset 
of increasing waves, another approach serves better. 

From the symmetry of the equations and of the boundary conditions 
we see that there are solutions for y (and consequently for V and p) 
which are even in y and again some which are odd in y. Consider first the 
even solutions. We will assume that there is an even function, yi(y), 
periodic in y with period 2y), which coincides with y(y) in the open 
interval, —yo < y < yo and that y(y) has a Fourier cosine series repre- 
sentation: 
wily) = pp Cn COSAnY An = nT on =0, 1,2,--- (219) 


Yo 


y inside the interval satisfies (2.15), so we assume that yi(y) obeys 


(D? — B°)[(w"D? + 9°)? + wp (uD? — 2) 
| 2 (2.20) 
= >) y — 2m + 1yo) 


M=—co 


where 6 is the familiar 6-function. Since Dy and D'y are required to vanish 
at the ends of the interval and y, D’y and Dy are even it follows that all 
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of these functions are continuous. We assume that yi = ¥, Dj = Dy, 
Dy, = Db, Di = DW and Di, = Dy at the ends of the intervals. 
From (2.20), u‘D—- —% as y > yo. 

Since 


+20 ee 1 1 re} 
> iy — 2m + Iyo) = — + — D> (—1)" cos Any (2.21) 
oo 2Yo Yo 1 


we obtain from (2.20) 


1 
2yu. = -( sexe a 
(2 GQ? os 2 
pe Tee (2,22) 
ayn COS AnY ) 
aa 2 dX ( 1) (B? + An?) [(Q? = UyAn?)” == a ?(Q? + Urn")| 
Since 
aV 27n2 2\2 
ay BV = (D+ B)(wD + @/y, 
using (2.4), the condition for matching to the external field, 
aV 
yields, using Dy = D'y = 0 and u‘D'y = — 1, the relation 
(m'D’ + W)n = YB aty = yo. 
Applying this to (2.22), we then obtain, finally, 
Us eet 
2 ? ae 2 
B B*| wy" (2,23) 
4 9 > (Q? -_ di Ne 


i (8? + An) [(Q? =< Urn)? es wp?(Q? + Urr?)| 


For the odd solution we use a function, y2(y), equal to y(y) in —y < 
y < yo and representable by a sine series. To ensure the vanishing of Dy 
and D’y at y = -yo it is appropriate to use the functions, sin u»y, where 
Bn = (n + 16)r/yo. The period is now 4y and we define yo(y) in yo < 
y < 3yo by the relation yo(y) = Y(2y. — y) and in —3y < y < —y by 
Voy) = ¥(—2y — y). Thus, we write 


yoy) = > dn Sin unYy en = (n + Y)rr/yo 


¥2(y) will be supposed to satisfy 
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(D? — 6 )[(urD? + 9) + we) (urD? — 1) [pr 
te Bet eee, ee oe (2.24) 
= do [ay — 4m + Lye) — 8(y — 4m — Ty) 


The extended definition of 2 (outside — yo < y < yo) is such that we may 
again take yy = y,----, Dy, = D‘yat the ends of the interval. ‘Dy, is 
still equal to — 4% at y = yw. Now 


+00 
a [iy — 4m + ly) — hy — 4m — Iyo)] 
7 (2.25) 


1 : 
a —1)”" sin pn 
7 » (-1) Hn 
so from (2.24) we may find 


= (—1)" sin pry 
= —2)) oS OOOoeeoooT™-_—C—‘=*"#ED'. 2 
Whe = — 2 TE — ade — oA + ae 9) 
Matching to the external field as before gives 
(u2D? + O)Y,=25 aby =y 
28 
and applied to (2.26) we have 
_ Yo < (9? = Ur bn) (2 27) 
= 0B:<° 0 (6? + Br? )[(Q? = Uren)? = wp'(O? ae Urpin?) ] 
The equations (2.23) and (2.27) for the even and odd modes may be 
rewritten using the following reduced variables. 


Byo 


TT 
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WYo Us 
pe CU 
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Tuy? 
(2.23) becomes 
= (n? — k’)’ 
poet? Lape @oe awe 7 28) 
and (2.27) transforms to 
0 2 1 2 _ 2272 
2 > - - [a+ %) — Fk] 





e+ a + 4) [a +P — PP Am + BP + (2.29) 


= —1z 
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We shall assume in considering (2.28) and (2.29) that the beam is 
sufficiently wide for the transit of an electron from one side to the other 
to take a few RF cycles. The number of cycles is in fact, wiyo/mu , and, 
hence, from the definition of z, we see that for values of k less than 2, 
perhaps, 2 is certainly positive. 

Let us consider (2.29) first since it proves to be the simpler case. If we 
transfer the term zz to the right hand side, it follows from the observa- 
tion that z is positive (for modest values of k), that it is necessary to 
make the sum negative. The sum may be studied qualitatively by sketch- 
ing in the kh? — & plane the lines on which the individual terms go to 
infinity, given by 


2 _ [n+ %)* — kT} 
8 re (2.30) 
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Fig. 3 shows a few such curves (n = 0, 1, 2). To the right of such curves 
the individual term in question is negative, except on the line, kh? = 
(n + 1%)’, where it attains the value of zero. Approaching the curves 
from the right the terms go to — ~. On the left of the curves the func- 
tion is positive and goes to + © as the curve is approached from the 
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left. Clearly in the regions marked + which lie to the left of every curve 
given by (2.30), the sum is positive and we cannot have roots. Let us 
examine the sum in the region to the right of the n = 0 curve and to the 
left of all others. On the line, k’ = 14, the sum is positive, since the first 
term is zero. On any other line, k> = constant, the sum goes from + © 
at the n = 1 curve monotonically to — © at the n = 0 curve, so that 
somewhere it must pass through 0. This enables us to draw the zero- 
sum contours qualitatively in this region and they are indicated in Fig. 3. 
We are now in a position to follow the variation in the sum as k varies 
at fixed 8°. It is readily seen that for 6° < 0.25, because —7z is negative 
in the region under consideration, there will be four real roots, two for 
positive, two for negative k. For 5 slightly greater than 0.25, the sum has 
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a deep minimum for k = 0, so that there are still four real roots unless z 
is very large. For z fixed, as 4° increases, the depth of the minimum de- 
creases and there will finally occur a 6’ for which the minimum is so shal- 
low that two of the real roots disappear. Call z(0) the value of z for k = 0, 
write the sum as (6, k’) and suppose that 2(6),.0) = —7z(0), then for 
small k we have 








oz 
2(8, #) = —x2(0) + (8 — a) 2 sa + Ph 0) 
dz 
2 Ufo, _ 08? (2 92 
k am k= aD (6 — 6) 
ok? ok? 
a= 
_ U1/Uo 05° (5 luo\" 
ea a oe Oda 
ok? ok? 2 


The roots become complex when 


= 62 — (ur/u0)” 
4 28 
08? ok? 
Since w:/u may be considered small (say 10 per cent) it is sufficient to 


look for the values of 6)°. 
When i” = 0 we have 


2” z (n + %)° 


WT SPE + lo wo — 
_ 227 < 5 Zz 
~ae0¥ orig et wre) 


=o a x (6 tan m5 + z tanh zz) 
Fig. 4 shows the solution of this equation for various 2(0) or wyo/muo . 
Clearly the threshold 6 is rather insensitive to variations in wYo/7uo . 
Equation (2.28) may be examined by a similar method, but here some 
complications arise. Fig. 5 shows the infinity curves for n = 0, 1, 2, 3; 
the n = 0 term being of the form k’/k’ — 6’. The lowest critical region 
in 8’ is the neighborhood of the point k” = 6” = 14, which is the intersec- 
tion of the n = 0 and n = 1 lines. To obtain an idea of the behavior of 
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the left hand side (1.h.s.) of (2.28) in this area we first see how the point 
k? = 6 = 14 can be approached so that the l.h.s. remains finite. If we 
put kh? = %+-eando = % + ce and expand the first two dominant 
terms of (2.28), then adjust ¢ to keep the result finite as e — 0 we find 


des 132 —5 

432+ 1 
c varies from — 54 to 14 asz goes from0 to ~, changing sign at 2” = 54. 
Every curve for which the ].h.s. is constant makes quadratic contact with 
the line & — 144 = c(h’ — 14) atk’ = & = \. If we remember that 
the l.h.s. is positive for kX = 0,0 < & <1and fork’ = 1,0 <8 <1, 
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since there are no negative terms in the sum for these ranges and again 
that the l.h.s. must change sign between the n = 0 and n = 1 lines for 
any k’ in the range 0 < k’ < 1 (since it varies from Fo to +), this 
information may be combined with that about the immediate vicinity 
of & = k’ = \ to enable us to draw a line on which the Lhs. is zero. 
This is indicated in Figs. 6A and 6B for small z and large z respec- 
tively. It will be seen that the zero curve and, in fact, all curves on which 
the ].h.s. is equal to a negative constant are required to have a vertical 
tangent at some point. This point may be above or below k° = ¥% (de- 
pending upon the sign of ¢ or the size of z) but always at a6’ > 14. For - 
5 < 14 there are no regions where roots can arise as we can readily see 
by considering how the l.h.s. varies with k’ at fixed 6°. For a fixed & > 14 
we have, then, either fork” > 4 or kh’ < 14, according to the size of z, 
a negative minimum which becomes indefinitely deep as 6° — 14. Thus, 
since the negative terms on the right-hand side are not sensitive to small 
changes in 6°, we must expect to find, for a fixed value of the l.h.s., two 
real solutions of (2.28) for some values of 5° and no real solutions for some 
larger value of 6’, since the negative minimum of the |.h.s. may be made 
as shallow as we like by increasing 6’. By continuity then we expect to 
find pairs of complex roots in this region. Rather oddly these roots, which 
will exist. certainly for 6’ sufficiently close to 144 + 0, will disappear if 
5° is sufficiently increased. 
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Coupled Helices 


By J.S. COOK, R. KOMPFNER and C. F. QUATE 
(Received September 21, 1955) 


An analysis of coupled helices is presented, using the transmission line 
approach and also the field approach, with the objective of providing the 
tube designer and the microwave circuit engineer with a basis for approxi- 
mate calculations. Devices based on the presence of only one mode of propa- 
gation are briefly described; and methods for establishing such a mode are 
given. Devices depending on the simultaneous presence of both modes, that 
is, depending on the beat wave phenomenon, are described; some experi- 
mental results are cited in support of the view that a novel and useful class of 
coupling elements has been discovered. 
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GLOSSARY OF SYMBOLS 


Mean radius of inner helix 

Mean radius of outer helix 

Capacitive coupling coefficient 

shunt susceptance of inner and outer helices, respectively 

Shunt susceptance plus mutual susceptance of inner and outer 
helices, respectively, Bio + Bm, Boo + Bm 

Mutual susceptance of two coupled helices 

Velocity of light in free space 

Radial separation between helices, b-a 

Directivity of helix coupler 

Electric field intensity 

Maximum fraction of power transferable from one coupled helix 
to the other 

Impedance parameter 

RF current in inner and outer helix, respectively 

Impedance in terms of longitudinal electric field on helix axis 
and axial power flow 

Minimum axial distance required for maximum energy transfer 
from one coupled helix to the other, \,/2 

Axial power flow along helix circuit 

Radial coordinate 

Radius where longitudinal component of electric field is zero for 
transverse mode (about midway between a and b) 

Return loss 

Radial separation between helix and adjacent conducting shield 
Time 

RF potential of inner and outer helices, respectively 

Inductive coupling coefficient 

Series reactance of inner and outer helices, respectively 

Series reactance plus mutual reactance of inner and outer helices, 
respectively, X10 + Xm, X20 + Xm 

Mutual reactance of two coupled helices 

Axial coordinate 

Impedance of inner and outer helix, respectively 

Attenuation constant of inner and outer helices, respectively 
General circuit phase constant; or mean circuit phase constant, 
V BiB2 

Free space phase constant 

Axial phase constant of inner and outer helices in absence of 


coupling, YW BX, V BoXe0 
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81,2 May be considered as axial phase constant of inner and outer 
helices, respectively 


Bp Beat phase constant 

Be Coupling phase constant, (identical with 8 when 6: = £2) 

Bee Coupling phase constant when there is dielectric material be- 
tween the helices 

Ba Difference phase constant, | 61 — Bs | 

Be Axial phase constant of single helix in presence of dielectric 

B:,2 Axial phase constant of transverse and longitudinal modes, re- 
spectively 

¥ Radial phase constant 

y:,¢ Radial phase constant of transverse and longitudinal modes, 
respectively 

r Axial propagation constant 


T;,2 Axial propagation constant for transverse and longitudinal 
coupled-helix modes, respectively 


€ Dielectric constant 

é Relative dielectric constant, €/é 

Ep Dielectric constant of free space 

r General circuit wavelength; or mean circuit wavelength, ~/Ai\2 
No Free space wavelength 

M1, 2 Axial wavelength on inner and outer helix, respectively 

Np Beat wavelength 

Ne Coupling wavelength (identical with \, when 6; = {e) 

y Helix pitch angle 


V1, 2 Pitch angle of inner and outer helix, respectively 
w Angular frequency 


1. INTRODUCTION 


Since their first appearance, traveling-wave tubes have changed only 
very little. In particular, if we divide the tube, somewhat arbitrarily, 
into circuit and beam, the most widely used circuit is still the helix, and 
the most widely used transition from the circuits outside the tube to the 
circuit inside is from waveguide to a short stub or antenna which, in 
turn, is attached to the helix, either directly or through a few turns of 
increased pitch. Feedback of signal energy along the helix is prevented 
by means of loss, either distributed along the whole helix or localized 
somewhere near the middle. The helix is most often supported along its 
whole length by glass or ceramic rods, which also serve to carry a con- 
ducting coating (‘‘aquadag’’), acting as the localized loss. 

We therefore find the following circuit elements within the tube en- 
velope, fixed and inaccessible once and for all after it has been sealed off: 
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1. The helix itself, determining the beam voltage for optimum beam- 
circuit interaction; 

2. The helix ends and matching stubs, etc., all of which have to be 
positioned very precisely with relation to the waveguide circuits in 
order to obtain a reproducible match; 

3. The loss, in the form of ‘“‘aquadag” on the support rods, which 
greatly influences the tube performance by its position and distribution. 

In spite of the enormous bandwidth over which the traveling-wave 
tube is potentially capable of operating — a feature new in the field of 
microwave amplifier tubes — it turns out that the positioning of the tube 
in the external circuits and the necessary matching adjustments are 
rather critical; moreover the overall bandwidths achieved are far short 
of the obtainable maximum. 

Another fact, experimentally observed and well-founded in theory, 
rounds off the situation: The electro-magnetic field surrounding a helix, 
i.e., the slow wave, under normal conditions, does not radiate, and is 
confined to the close vicinity of the helix, falling off in intensity nearly 
exponentially with distance from the helix. A typical traveling-wave 
tube, in which the helix is supported by ceramic rods, and the whole 
enclosed by the glass envelope, is thus practically inaccessible as far as 
RF fields are concerned, with the exception of the ends of the helix, 
where provision is made for matching to the outside circuits. Placing 
objects such as conductors, dielectrics or distributed loss close to the 
tube is, in general, observed to have no effect whatsoever. 

In the course of an experimental investigation into the propagation of 
space charge waves in electron beams it was desired to couple into a long 
helix at any point chosen along its length. Because of the feebleness of 
the RF fields outside the helix surrounded by the conventional sup- 
ports and the envelope, this seemed a rather difficult task. Nevertheless, 
if accomplished, such a coupling would have other and even more im- 
portant applications; and a good deal of thought was given to the 
problem. 

Coupled concentric helices were found to provide the solution to the 
problem of coupling into and out of a helix at any particular point, and to 
a number of other problems too. 

Concentric coupled helices have been considered by J. R. Pierce,” 
who has treated the problem mainly with transverse fields in mind. 
Such fields were thought to be useful in low-noise traveling-wave tube 
devices. Pierce’s analysis treats the helices as transmission lines coupled 
uniformly over their length by means of mutual distributed capacitance 
and inductance, Pierce also recognized that it is necessary to wind the 
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two helices in opposite directions in order to obtain well defined trans- 
verse and axial wave modes which are well separated in respect to their 
velocities of propagation. 

Pierce did not then give an estimate of the velocity separation which 
might be attainable with practical helices, nor did anybody (as far as we 
are aware) then know how strong a coupling one might obtain with such 
helices. 

It was, therefore, a considerable (and gratifying) surprise’’* to find 
that concentric helices of practically realizable dimensions and separa- 
tions are, indeed, very strongly coupled when, and these are the im- 
portant points, 

(a) They have very nearly equal velocities of propagation when un- 
coupled, and when 

(b) They are wound in opposite senses. 

It was found that virtually complete power transfer from outer to 
inner helix (or vice versa) could be effected over -a distance of the order 
of one helix wavelength (normally between 1/9 and Mo of a free-space 
wavelength. 

It was also found that it was possible to make a transition from a co- 
axial transmission line to a short (outer) helix and thence through the 
glass surrounding an inner helix, which was fairly good over quite a con- 
siderable bandwidth. Such a transition also acted as a directional coupler, 
RF power coming from the coaxial line being transferred to the inner 
helix predominantly in one direction. 

Thus, one of the shortcomings of the ‘‘conventional” helix traveling- 
wave tube, namely the necessary built-in accuracy of the matching 
parameters, was overcome by means of the new type of coupler that 
might evolve around coupled helix-to-helix systems. 

Other constructional and functional possibilities appeared as the 
work progressed, such as coupled-helix attenuators, various types of 
broadband couplers, and schemes for exciting pure transverse (slow) or 
longitudinal (fast) waves on coupled ‘helices. 

One central fact emerged from all these considerations: by placing 
part of the circuit outside the tube envelope with complete independence 
from the helix terminations inside the tube, coupled helices give back to 
the circuit designer a freedom comparable only with that obtained at 
much lower frequencies. For example, it now appears entirely possible 
to make one type of traveling wave tube to cover a variety of frequency 
bands, each band requiring merely different couplers or outside helices, 
the tube itself remaining unchanged. 

Moreover, one tube may now be made to fulfill a number of different 
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functions; this is made possible by the freedom with which couplers 
and attenuators can be placed at any chosen point along the tube. 

Considerable work in this field has been done elsewhere. Reference 
will be made to it wherever possible. However, only that work with 
which the authors have been intimately connected will be fully reported 
here. In particular, the effect of the electron beam on the wave propaga- 
tion phenomena will not be considered. 


2. THEORY OF COUPLED HELICES 
2.1 Introduction 


In the past, considerable success has been attained in the under- 
standing of traveling wave tube behavior by means of the so-called 
“transmission-line” approach to the theory. In particular, J. R. Pierce 
used it in his initial analysis and was thus able to present the solution 
of the so-called traveling-wave tube equations in the form of 4 waves, 
one of which is an exponentially growing forward traveling wave paele 
to the operation of the tube as an amplifier. 

This transmission-line approach considers the helix —or any slow- 
wave circuit for that matter — as a transmission line with distributed 
capacitance and inductance with which an electron beam interacts. 
As the first approximation, the beam is assumed to be moving in an RF 
field of uniform intensity across the beam. 

In this way very simple expressions for the coupling parameter and 
gain, etc., are obtained, which give one a good appreciation of the 
physically relevant quantities. 

A number of factors, such as the effect of space charge, the non-uniform 
distribution of the electric field, the variation of circuit impedance with 
frequency, etc., can, in principle, be calculated and their effects can be 
superimposed, so to speak, on the relatively simple expressions deriving 
from the simple transmission line theory. This has, in fact, been done and 
is, from the design engineer’s point of view, quite satisfactory. 

However, physicists are bound to be unhappy over this state of 
affairs. In the beginning was Maxwell, and therefore the proper point to 
start from is Maxwell. 

So-called “Field” theories of traveling-wave tubes, based on Maxwell’s 
equation, solved with the appropriate boundary conditions, have been 
worked out and their main importance is that they largely confirm the 
results obtained by the inexact transmission line theory. It is, however, 
in the nature of things that field theories cannot give answers in terms of 
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simple closed expressions of any generality. The best that can be done 
is in the form of curves, with step-wise increases of particular param- 
eters. These can be of considerable value in particular cases, and when 
exactness is essential. 

In this paper we shall proceed by giving the ‘‘transmission-line” type 
theory first, together with the elaborations that are necessary to arrive 
at an estimate of the strength of coupling possible with coaxial helices. 
The “‘field’”’ type theory will be used whenever the other theory fails, or 
is inadequate. Considerable physical insight can be gotten with the use 
of the transmission-line theory; nevertheless recourse to field theory is 
necessary in a number of cases, as will be seen. 

It will be noted that in all the calculations to be presented the presence 
of an electron beam is left out of account. This is done for two reasons: 
Its inclusion would enormously complicate the theory, and, as will 
eventually be shown, it would modify our conclusions only very slightly. 
Moreover, in practically all cases which we shall consider, the helices are 
so tightly coupled that the velocities of the two normal modes of propaga- 
tion are very different, as will be shown. Thus, only when the beam 
velocity is very near to either one or the other wave velocity, will 
growing-wave interaction take place between the beam and the helices. 
In this case conventional traveling wave tube theory may be used. 

A theory of coupled helices in the presence of an electron beam has 
been presented by Wade and Rynn,* who treated the case of weakly 
coupled helices and arrived at conclusions not at variance with our views. 


2.2 Transmission Line Equations 


Following Pierce we describe two lossless helices by their distributed 
series reactances Xip and Xz and their distributed shunt susceptances 
By and Bo . Thus their phase constants are 


P10 =V ByX10 
Bo = V BooX a9 


Let these helices be coupled by means of a mutual distributed reac- 
tance X,, and a mutual susceptance B,,, both of which are, in a way 
which will be described later, functions of the geometry. 

Let waves in the coupled system be described by the factor 

gitte Te 


where the I’s are the propagation constants to be found. 
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The transmission line equations may be written: 


Th, — GB, + IBV 2 = 0 


TV, — jXih + jXml2 = 0 (2.2.1) 
Tl. — jB2V2 + JBnVi = 0 ; 
[V2 — jXela + jXnli = 0 
where 
By = By + Bn 
X1 = Xp + Xm 
By = By + Bn 
X2 = Xo + Xm 
I, and J, are eliminated from the (2.2.1) and we find 
Vo _ +" + XiBi + XnBn) (2.29) 
Vi X1Bn + BoXm 
Vi _ +0 + X2B + XnBm) (2.2.3) 
Ve X2Bm + BiXm 


These two equations are then multiplied together and an expression for 
I of the 4th degree is obtained: 


TCG Br a RGB Ok Ror 








; : (2.2.4) 
+ (Xi X2 ma Ae )(B, Be ora Bi) = 
We now define a number of dimensionless quantities: 
2 
Bee ie b” = (capacitive coupling coefficient)’ 
BB. 
Xn = x = (inductive coupling coefficient)” 
X1Xe 
BX, = Br, BoX, = Bo 
X,B,X2.B. = 6° = (mean phase constant)* 
With these substitutions we obtain the general equation for I” 
P= a*| - ee ~ 
Be BY 
(2:2:5) 





ct i/i(E eae 3+ 2b) — (1-2) - v9 | 
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If we make the same substitutions in (2.2.2) we find 


Veo {72 E + Or + a s5%) 
Vy Zit B(6ib + Bor) — 


where the Z’s are the impedances of the helices, i.e., 
Zn = VXn/Ba 


2.3 Solution for Synchronous Helices 


Let us consider the particular case where 8: = B2 = 8. From (2.2.5) 
we obtain 


lr’ = —pL + xb + (a + d)] (2.3.1) 


Each of the above values of I” characterizes a normal mode of propaga- 
tion involving both helices. The two square roots of each I” represent 
waves going in the positive and negative directions. We shall consider 
only the positive roots of I’, denoted I, and I'¢, which represent the 
forward traveling waves. 


Tye = jBV/1 + xb + (x + D) (2.3.2) 
Ifz >Oandb>0 
Pee ell Dal ye 


Thus IT; represents a normal mode of propagation which is slower than 
the propagation velocity of either helix alone and can be called the 
“slow” wave. Similarly I’; represents a “fast”? wave. We shall find that, 
in fact, x and b are numerically equal in most cases of interest to us; we 
therefore write the expressions for the propagation constants 


PT, = jell + Yale + 6) 
(2.3.3) 
Te = jell — a(x + 6)] 


If we substitute (2.3.3) into (2.2.6) for the case where 6; = 6. = 8 and 
assume, for simplicity, that the helix self-impedances are equal, we find 
that for Tl = T; 


V2 
Lome | 
V1 

forT = [, 
Vige5 4 


V; 
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Thus, the slow wave is characterized by equal voltages of unlike sign on 
the two helices, and the fast wave by equal voltages of like sign. It fol- 
lows that the electric field in the annular region between two such coupled 
concentric helices will be transverse for the slow wave and longitudinal 
for the fast. For this reason the slow and fast modes are often referred 
to as the transverse and longitudinal modes, respectively, as indi- 
cated by our subscripts. 

It should be noted here that we arbitrarily chose b and x positive. A 
different choice of signs cannot alter the fact that the transverse mode is 
the slower and the longitudinal mode is the faster of the two. 

Apart from the interest in the separate existence of the fast and slow 
waves as such, another object of interest is the phenomenon of the simul- 
taneous existence of both waves and the interference, or spatial beating, 
between them. 

Let V2 denote the voltage on the outer helix; and let Vi, the voltage 
on the inner halix, be zero at z = 0. Then we have, omitting the common 
factor ¢7*", 

Vi = Vue" + Vae 
V2 = Vise" + Vine @ 


Sinceatz = 0, Vi = 0, Va = — Vz, . For the case we have considered we 
have found Va = —Veyand Va = Ve. We can write (2.3.4) as 


(2.3.4) 


Vi ay y ge = ge try 


(2.3.5) 
Vea Feet pet 
Ve can be written | | 
Vee aaa 4 Ure To 
= Ve PUTO? cog [—JG(T: — Taal 
In the case when x = b, and #1 = & = B 
V2 = Ve cos [}4(a + b) 6] (2.3.6) 
Correspondingly, it can be shown that the voltage on the inner helix is 
Vi = jVe™ sin [14 (a + b)6e] (2.3.7) 


The last two equations exhibit clearly what we have called the spatial 
beat phenomenon, a wave-like transfer of power from one helix to the 
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other and back. We started, arbitrarily, with all the voltage on the outer 
helix at zg = 0, and none on the inner; after a distance, 2’, which makes 
the argument of the cosine 7/2, there is no voltage on the outer helix 
and all is on the inner. 

To conform with published material let us define what we shall call 
the “coupling phase-constant”’ as 


Be = Bb + 2) (2.3.8) 
From (2.3.3) we find that for 6:1 = Bf. = 8, and z = b, 
| T, — Te = j6. 


2.4 Non-Synchronous Helix Solutions 


Let us now go back to the more general case where the propagation 
velocities of the (uncoupled) helices are not equal. Equation (2.2.5) can 
be written: 


P= =: (1 + (1/2)A + xb + 
V(1 + xbd)A + (1/4)A? + (b + 2)?] 


_ = Bf 
A ee ee 
PS 


In the case where x = b, (2.4.1) has an exact root. 


Tye = IVI + A/4 + 1/2 Va + @ + bY (2.4.2) 
We shall be interested in the difference between I; and Tz, 


(2.4.1) 


where 





Ty — Te = j8 VA + (x + bY (2.4.3) 
Now we substitute for A and find 
Ty— Te = j V6 — 62)? + 62 (b + xP (2.4.4) 


Let us define the “beat phase-constant”’ as: 
Bs = V(Bi — Bx) + Bb + x}? 
so that 
T;—- Te = j® (2.4.5) 
Further, let us define 
Ba = | B1 — Be | 
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and call this the “difference phase-constant,” i.e., the hase constant cor- 
responding to two uticoupled waves of the same frequency but differing 
phase velocities. We can thus state the relation between these phase 
constants: 


Bs = Ba + Be (2.4.6) 


This relation is identical (except for notation) with expression (33) in 
8. E. Miller’s paper.® In this paper Miller also gives expressions for the 
voltage amplitudes in two coupled transmission systems in the case of 
unequal phase velocities. It turns out that in such a case the power trans- 
fer from one system to the other is necessarily incomplete. This is of 
particular interest to us, in connection with a number of practical 
schemes. In our notation it is relatively simple, and we can state it by 
saying that the maximum fraction of power transferred is 


P= (2) (2.4.7) 
or, in more detail, 
pa—fe BE a 
Bee (8i— Be) +60 42) 
This relationship can be shown to be a good approximation from (2.2.6), 


(2.3.4), (2.4.2), on the assumption that b } «x and Z; } Z,, and the 
further assumption that the system is lossless; that is, 


; | Vo|* + | Vi| °=constant (2.4.8) 


We note that the phase velocity difference gives rise to two phenomena: 
It reduces the coupling wavelength and it reduces the amount of power 
that can be transferred from one helix to the other. 

Something should be said about the case where the two helix maled: 
ances are not equal, since this, indeed, is usually the case with coupled 
concentric helices. Equation (2.4.8) becomes: 


[Vel , |Vil’ 
Z| th 


Using this relation it is found from (2.3.4) that 


Vo [fa _ 4/3 —— 
V2 4/2 = peeve?) (2.4.10) 


When this is combined with (2.2.6) it is found that the impedances drop 
out with the voltages, and that “I” is a function of the 6’s only. In other 


= constant (2.4.9) 
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words, complete power transfer occurs when 6; = 62 regardless of the 
relative impedances of the helices. 

The reader will remember that 619 and Be , not 8; and 2 , were defined 
as the phase constants of the helices in the absence of each other. If the 
assumption that b ® x is maintained, it will be found that all of the de- 
rived relationships hold true when 8,0 is substituted for 6, . In other 
words, throughout the paper, 8; and 6: may be treated as the phase con- 
stants of the inner and outer helices, respectively. In particular it should 
be noted that if these quantities are to be measured experimentally each 
helix must be kept in the same environment as if the helices were coupled; 
only the other helix may be removed. That is, if there is dielectric in the 
annular region between the coupled helices, 8; and 82 must each be 
measured in the presence of that dielectric. 

Miller also has treated the case of lossy coupled transmission systems. 
The expressions are lengthy and complicated and we believe that no 
substantial error is made in simply applying his conclusions to our case. 

If the attenuation constants ai and a: of the two transmission systems 
(helices) are equal, no change is required in our expressions; when they 
~ are unequal the total available power (in both helices) is most effectively 
reduced when 


ay — a2 
——— 1 2.4.11 
aa ( ) 


This fact may be made use of in designing coupled helix attenuators. 


2.5 A Look at the Fields 


It may be advantageous to consider sketches of typical field distribu- 
tions in coupled helices, as in Fig. 2.1, before we go on to derive a quanti- 
tative estimate of the coupling factors actually obtainable in practice. 

Fig. 2.1(a) shows, diagrammatically, electric field lines when the 
coupled helices are excited in the fast or “longitudinal” mode. To set up 
this mode only, one has to supply voltages of like sign and equal ampli- 
tudes to both helices. For this reason, this mode is also sometimes called 
the ‘(-+-++) mode.” 

Fig. 2.1(b) shows the electric field lines when the helices are excited in 
the slow or “transverse” mode. This is the kind of field required in the 
transverse interaction type of traveling wave tube. In order to excite 
this mode it is necessary to supply voltages of equal amplitude and 
opposite signs to the helices and for this reason it is sometimes called the 
“(+ —) mode.” One way of exciting this mode consists in connecting one 
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helix to one of the two conductors of a balanced transmission line 
(‘‘Lecher’’-line) and the other helix to the other. 

Fig. 2.1(c) shows the electric field configuration when fast and slow 
modes are both present and equally strongly excited. We can imagine 
the two helices being excited by a voltage source connected to the outer 
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Fig. 2.1 — Typical electric ficld distributions in coupled coaxial helices when 
they are excited in: (a) the in-phase or longitudinal mode, (b) the out-of-phase or 
transverse mode, and (c) both modes equally. 
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helix only at the far left side of the sketch. One, perfectly legitimate, 
view of the situation is that the RF power, initially all on the outer helix, 
leaks into the inner helix because of the coupling between them, and then 
leaks back to the outer helix, and so forth. 

Apart from noting the appearance of the stationary spatial beat (or 
interference) phenomenon these additional facts are of interest: 

1) It is a simple matter to excite such a beat-wave, for instance, by 
connecting a lead to either one or the other of the helices, and 

2) It should be possible to discontinue either one of the helices, at 
points where there is no current (voltage) on it, without causing reflec- 
tions. 


2.6 A Simple Estimate of b and x 


How strong a coupling can one expect from concentric helices in prac- 
tice? Quantitatively, this is expressed by the values of the coupling fac- 
tors x and b, which we shall now proceed to estimate. 

A first crude estimate is based on the fact that slow-wave fields are 
known to fall off in intensity somewhat as e*” where @ is the phase con- 
stant of the wave and r the distance from the surface guiding the slow 
wave. Thus a unit charge placed, say, on the inner helix, will induce a 
charge of opposite sign and of magnitude 

g boa) 


on the outer helix. Here b = mean radius of the outer helix and a = 
mean radius of the inner. We note that the shunt mutual admittance 
coupling factor is negative, irrespective of the directions in which the 
helices are wound. Because of the similarity of the magnetic and electric 
field distributions a current flowing on the inner helix will induce a simi- 
larly attenuated current, of amplitude 


ef (b—a) 


on the outer helix. The direction of the induced current will depend on 
whether the helices are wound in the same sense or not, and it turns out 
(as one can verify by reference to the low-frequency case of coaxial 
coupled coils) that the series mutual impedance coupling factor is nega- 
tive when the helices are oppositely wound. 

In order to obtain the greatest possible coupling between concentric 
helices, both coupling factors should have the same sign. This then re- 
quires that the helices should be wound in opposite directions, as has 
been pointed out by Pierce. 

When the distance between the two helices goes to zero, that is to say, 


142 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 


if they lie in the same surface, it is clear that both coupling factors b and x 
will go to unity. 

As pointed out earlier in Section 2.3, the choice of sign for b is arbi- 
trary. However, once a sign for b has been chosen, the sign of x is neces- 
sarily the opposite when the helices are wound in the same direction, and 


vice versa. We shall choose, therefore, 
b = +e be 
na Fe hte (2.6.1) 


the sign of the latter depending on whether the helices are wound in the 
same direction or not. 
In the case of unequal velocities, 8, the propagation constant, would 


_ be given by | 
B = VBiBo (2.6.2) 


2.7 Strength of Coupling versus Frequency 


The exponential variation of coupling factors with respect to frequency 
(since 8 = w/v) has an important consequence. Consider the expression 
for the coupling phase constant 








Be = B(b + 2) (2.3.8) 
or 
| | 6.| = 26¢°° (2.7.1) 
The coupling wavelength, which is defined as 
20 
he = | — (2.7.2) 
Be 
is, therefore, 
T B(b—a) 
Ne = 5 e 
B 
or . 
ey a (2.7.3) 


“where is the (slowed-down) RF wavelength on either helix. It is con- 
venient to multiply both sides of (2.7.1) with a, the inner helix radius, 
in order to obtain a dimensionless relation between 6, and Bp: 


Bea = 2Bac PCO) (2.7.4) 


This relation is plotted on Fig. 2.2 for several values of b/a. 
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Fig. 2.2 — Coupling phase-constant plotted as a function of the single helix 
phase-constant for synchronous helices for several values of b/a. These curves 
are based on simple estimates made in Section 2.7. 


There are two opposing tendencies determining the actual physical 
length of a coupling beat-wavelength: 

1) It tends to grow with the RF wavelength, being proportional to it 
in the first instance; . 

2) Because of the tighter coupling possible as the RF wavelength 
increases in relation to the helix-to-helix distance, the coupling beat- 
wavelength tends to shrink. 

Therefore, there is a region where these tendencies cancel each other, 
and where one would expect to find little change of the coupling beat- 
wavelength for a considerable change of RF frequency. In other words, 
the “bandwidth” over which the beat-wavelength stays nearly constant 
can be large. 

This is a situation naturally very desirable and favorable for any 
device in which we rely on power transfer from one helix to the other by 
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means of a length of overlap between them an integral number of half 
beat-wavelengths long. Obviously, one will design the helices in such a 
way as to take advantage of this situation. 

Optimum conditions are easily obtained by differentiating 8. with 
respect to 6 and setting 08./08 equal to zero..This gives for the optimum 
conditions 


1 
Bopt = —— 


—- @ 


(2.7.5) 


or 


26 = | 
Beout = b _ i = 2e se (2.7.6) 





Equation (2.7.5), then, determines the ratio of the helix radii if it is re- 
quired that deviations from a chosen operating frequency shall have 
least effect. 


2.8 Field Solutions 


In treating the problem of coaxial coupled helices from the transmis- 
sion line point of view one important fact has not been considered, 
namely, the dispersive character of the phase constants of the separate 
helices, 6: and 82. By dispersion we mean change of phase velocity with 
frequency. If the dispersion of the inner and outer helices were the same 
it would be of little consequence. It is well known, however, that the 
dispersion of a helical transmission line is a function of the ratio of helix 
radius to wavelength, and thus becomes a parameter to be considered. 
When the theory of wave propagation on a helix was solved by means of 
Maxwell’s equations subject to the boundary condition of a helically 
conducting cylindrical sheath, the phenomenon of dispersion first made 
its appearance. It is clear, therefore, that a more complete theory of 
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Fig. 2.3 — Sheath helix arrangement on which the field equations are based. 
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coupled helices will require similar treatment, namely, Maxwell’s equa- 
tions solved now with the boundary conditions of two cylindrical heli- 
cally conducting sheaths. As shown on Fig. 2.3, the inner helix is specified 
by its radius a and the angle ¥; made by the direction of conductivity 
with a plane perpendicular to the axis; and the outer helix by its radius 
b (not to be confused with the mutual coupling coefficient b) and its 
corresponding pitch angle y . We note here that oppositely wound helices 
require opposite signs for the angles y1 and y2 ; and, further, that helices 
with equal phase velocities will have pitch angles of about the same 
absolute magnitude. 

The method of solving Maxwell’s equations subject to the above men- 
tioned boundary conditions is given in Appendix I. We restrict our- 
selves here to giving some of the results in graphical form. 

The most universally used parameter in traveling-wave tube design is 
a combination of parameters: 


Boa cot Wy 


where 89 = 27/0, Xo being the free-space wavelength, a the radius of 
the inner helix, and y the pitch angle of the inner helix. The inner helix 
is chosen here in preference to the outer helix because, in practice, it will 
be part of a traveling-wave tube, that is to say, inside the tube envelope. 
Thus, it is not only less accessible and changeable, but determines the 
important aspects of a traveling-wave tube, such as gain, power output, 
and efficiency. 

The theory gives solutions in terms of radial propagation constants 
which we shall denote y; and yz (by analogy with the transverse and 
longitudinal modes of the transmission line theory). These propagation 
constants are related to the axial propagation constants 6; and Bz by 


Yn = VB => Bo 
Of course, in transmission line theory there is no such thing as a radial 
propagation constant. The propagation constant derived there and de- 


noted I’ corresponds here to the axial propagation constant 78. By 
analogy with (2.4.5) the beat phase constant should be written 


Bo = Bi — Be 


However, in practice 8 is usually much smaller than 6 and we can there- 
fore write with little error 


Po = Yt — Vt 


for the beat phase constant. For practical purposes it is convenient to 
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Fig. 2.4.1 — Beat phase-constant plotted as a function of Boa cot Yi. These 
ee result from the solution of the field equations given in the appendix. For 
b/a = 1.25. 


normalize in terms of the inner helix radius, a: 


Ba = ya — ya 


This has been plotted as a function of 8) a cot % in Fig. 2.4, which 
should be compared with Fig. 2.2. It will be seen that there is considerable 
agreement between the results of the two methods. 


2.9 Bifilar Helix 


The failure of the transmission line theory to take into account dis- 
persion is well illustrated in the case of the bifilar helix. Here we have 
two identical helices wound in the same sense, and at the same radius. 
If the two wires are fed in phase we have the normal mode characterized 
by the sheath helix model whose propagation constant is the familiar 
Curve A of Fig. 2.5. If the two wires of the helix are fed out of phase we 
have the bifilar mode; and, since that is a two wire transmission system, 
we shall have a TEM mode which, in the absence of dielectric, propa- 
gates along the wire with the velocity of light. Hence, the propagation 
constant for this mode is simply @oa cot w and gives rise to the horizontal 
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Fig. 2.4.2 — Beat phase-constant plotted as a function of Boa cot y, . These 
rib result from the solution of the field equations given in the appendix. For 
a= 1.5. 


line of Curve B in Fig. 2.5. Again the coupling phase constant 8, is given 
by the difference of the individual phase constants: 


Ba = Boa cot wy — ya (2.9.1) 


which is plotted in Fig. 2.6. Now note that when By) < y this equation is 
accurate, for it represents a solution of the field equations for the helix. 

From the simple unsophisticated transmission line point of view no 
coupling between the two helices would, of course, have been expected, 
since the two helices are identical in every way and their mutual capacity 
and inductance should then be equal and opposite. 

Experiments confirm the essential correctness of (2.9.1). In one experi- 
ment, which was performed to measure the coupling wavelength for the 
bifilar helices, we used helices with a cot y = 3.49 and a radius of 0.036 
em which gave a value, at 3,000 me, of Ba cot y = 0.51. In these experi- 
ments the coupling length, L, defined by 


(Boa cot wy — ya) 2 = 


was measured to be 15.7a as compared to a value of 13.5a from Fig. 2.6. 
At 4,000 mc the measured coupling length was 14,6a as compared to 


148 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 


= 


\ 





aes 
NE 





aed eine 
E 











Se eich 
Bi ale 
ay, 
race 





Mt A-M apa 


2 se 
Raksee 
iz RRS 


Ee 
Els) 
Eley 
=a 
Ke 
Pes 








| 
jz 
< 


fo ACOT 


Fig. 2.4.3 — Beat phase-constant plotted as a function of Boa cot ¥, . These 
curves ue from the solution of the field equations given in the appendix. For 
b/a = 1.75 


12.6a computed from Fig. 2.6, thus confirming the theoretical prediction 
rather well. The slight increase in coupling length is attributable to the 
dielectric loading of the helices which were supported in quartz tubing. 
The dielectric tends to decrease the dispersion and hence reduce 8, . This 
is discussed further in the next section. 


2.10 Lffect of Dielectric Material between Helices 


In many cases which are of interest in practice there is dielectric ma- 
terial between the helices. In particular when coupled helices are used 
with traveling-wave tubes, the tube envelope, which may be of glass, 
quartz, or ceramic, all but fills the space between the two helices. 

It is therefore of interest to know whether such dielectric makes any 
difference to the estimates at which we arrived earlier. We should not be 
surprised to find the coupling strengthened by the presence of the di- 
electric, because it is known that dielectrics tend to rob RF fields from 
the surrounding space, leading to an increase in the energy flow through 
the dielectric. On the other hand, the dielectric tends to bind the fields 
closer to the conducting medium. To find a qualitative answer to this 
question we have calculated the relative coupling phase constants for 
two sheath helices of infinite radius separated by a distance “d” for 1) 
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Fig. 2.4.4 — Beat phase-constant plotted as a function of Boa cot ¥,. These 
curves result from the solution of the field equations given in the appendix. For 
a= 2.0. 


the case with dielectric between them having a relative dielectric con- 
stant ¢’ = 4, and 2) the case of no dielectric. The pitch angles of the two 
helices were y and —y, respectively; i.e., the helices were assumed to be 
synchronous, and wound in the opposite sense. 

Fig. 2.7 shows a plot of the ratio of B.e/8. to B./8 versus By (d/2) cot y, 
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Fig. 2.4.5 — Beat phase-constant plotted as a function of Boa cot y, . These 
curves result from the solution of the field equations given in the appendix. For 
ii 0. 
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Fig. 2, 5 — Propagation constants for a bifilar helix plotted as a function of 
Boa cot yi. The curves illustrate, (A) the dispersive character of the in-phase 
mode and, (B) the non-dispersive "character of the out-of- phase mode. 


where 8.. is the coupling phase-constant in the presence of dielectric, 
6. is the phase-constant of each helix alone in the presence of the same 
dielectric, 8, is the coupling phase-constant with no dielectric, and £ is 
the phase constant of each helix in free space. In many cases of interest 
Bo(d/2) cot y is greater than 1.2. Then 


aU . E : | en (26/422) Bo (d/2) cot ¥ (2.10.1) 


Appearing in the same figure is a similar plot for the case when there is a 
conducting shield inside the inner helix and outside the outer, and 
separated a distance, ‘‘s,” from the helices. Note that 


d=b-a. 


It appears from these calculations that the effect of the presence of 
dielectric between the helices depends largely on the parameter By (d/2) 
cot y. For values of this parameter larger than 0.3 the coupling wave- 
length tends to increase in terms of circuit wavelength. For values smaller 
than 0.3 the opposite tends to happen. Note that the curve representing 
(2.10.1) is a fair approximation down to fo(d/2) cot y = 0.6 to the curve 
representing the exact solution of the field equations. J. W. Sullivan, in 
unpublished work, has drawn similar conclusions. 
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2.11 The Conditions for Maximum Power Transfer 


The transmission line theory has led us to expect that the most efficient 
power transfer will take place if the phase velocities on the two helices, 
prior to coupling, are the same. Again, this would be true were it not for 
the dispersion of the helices. To evaluate this effect we have used the 
field equation to determine the parameter of the coupled helices which 
gives maximum power transfer. To do this we searched for combinations 
of parameters which give an equal current flow in the helix sheath for 
either the longitudinal mode or the transverse mode. This was suggested 
by L. Stark, who reasoned that if the currents were equal for the indi- 
vidual modes the beat phenomenon would give points of zero RF current 
on the helix. 

The values of cot ~2/cot 1 which are required to produce this condi- 
tion are plotted in Fig. 2.8 for various values of b/a. Also there are shown 
values of cot ¥2/cot ¥; required to give equal axial velocities for the helices 
before they are coupled. It can be seen that the uncoupled velocity of the 
inner helix must be slightly slower than that of the outer. 

A word of caution is necessary for these curves have been plotted 
without considering the-effects of dielectric loading, and this can have a 
rather marked effect on the parameters which we have been discussing. 
The significant point brought out by this calculation is that the optimum 
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Fig. 2.6 — The coupling phase-constant which results from the two possible 
modes of propagation on a bifilar helix shown as a function of Bya cot 1 . 
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Fig. 2.7 — The effect of dielectric material between coupled infinite radius 
sheath helices on their relative coupling phase-constant shown as a function of 
Bod/2 cot y . The effect of shielding on this relation is also indicated. 


condition for coupling is not necessarily associated with equal velocities 
on the uncoupled helices. 


2.12 Mode Impedance 


Before leaving the general theory of coupled helices something should 
be said regarding the impedance their modes present to an electron beam 
traveling either along their axis or through the annular space between 
them. The field solutions for cross wound, coaxially coupled helices, 
which are given in Appendix IJ, have been used to compute the imped- 
ances of the transverse and longitudinal modes. The impedance, K, is 
defined, as usual, in terms of the longitudinal field on the axis and the 
power flow along the system. 
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In Fig. 2.9, (ya), for various ratios of inner to outer radius, is plotted 
for both the transverse and longitudinal modes together with the value 
of F(ya) for the single helix (b/a = ©). We sce that the longitudinal 
mode has a higher impedance with cross wound coupled helices than 
does a single helix. We call attention here to the fact that this is the 
same phenomenon which is encountered in the contrawound helix’, where 
the structure consists of two oppositely wound helices of the same radius. 

As defined here, the transverse mode has a lower impedance than the 
single helix. This, however, is not the most significant comparison; for 
it is the transverse field midway between helices which is of interest in 
the transverse mode. The factor relating the impedance in terms of the 
transverse field between helices to the longitudinal field on the axis is 
E,(*)/E7(0), where f is the radius at which the longitudinal component 
of the electric field Z,, is zero for the transverse mode. This factor, 
plotted in Fig. 2.10 as a function of Goa cot ¥, , shows that the impedance 
in terms of the transverse field at 7 is interestingly high. 
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Fig. 2.8 — The values of cot ¥2/cot y1 required for complete power transfer 
plotted as a function of Boa cot ¥; for several values of b/a. For comparison, the 
value of cot ¥2/cot ¥1 required for equal velocities on inner and outer helices is also 
shown. 
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Fig. 2.9 — Impedance parameter, F' (ya), associated with both transverse and 
longitudinal modes shown for several values of b/a. Also shown is F'(ya) for a 
single helix. 


It is also of interest to consider the impedance of the longitudinal 
mode in terms of the longitudinal field between the two helices. The 
factor, E7(7)/E;(0), relating this to the axial impedance is plotted in 
Fig. 2.11. We see that rather high impedances can also be obtained with 
the longitudinal field midway between helices. This, in conjunction with 
a hollow electron beam, should provide efficient amplification.’ 


3. APPLICATION OF COUPLED HELICES 


When we come to describe devices which make usc of coupled helices 
we find that they fall, quite naturally, into two separate classes. One 
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class contains those devices which depend on the presence of only one of 
the two normal modes of propagation. The other class of devices depends 
on the simultaneous presence, in roughly equal amounts, of both normal 
modes of propagation, and is, in general, characterized by the w ords 
“spatial beating.” Since spatial beating implies energy surging to and 
fro between inner and outer helix, there is no special problem in exciting 
both modes simultaneously. Power fed exclusively to one or the other 
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Fig. 2.10 — The relation between the impedance in terms of the transverse 
field between coupled helices excited in the out-of-phase mode, and the impedance 
in terms of the longitudinal field on the axis shown as a function of Boa cot Wi. 
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helix will inevitably excite both modes equally. When it is desired to 
excite one mode exclusively a more difficult problem has to be solved. 
Therefore, in section 3.1 we shall first discuss methods of exciting one 
mode only before going on to discuss in sections 3.2 and 3.3 devices 
using one mode only. 

In section 3.4 we shall discuss devices depending on the simultaneous 
presence of both modes. 


3.1 Excitation of Pure Modes 


3.1.1 Direct Hacitation 


In order to set up one or the other normal mode on coupled helices, 
voltages with specific phase and amplitudes (or corresponding currents) 
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Fig. 2.11 — The relation between the impedance in terms of the longitudinal 
field between coupled helices excited in the in-phase mode, and the impedance in 
terms of the longitudinal field on the axis shown as a function of Boa cot 1. 
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have to be supplied to each helix at the input end. A natural way of doing 
this might be by means of a two-conductor balanced transmission line 
(Lecher-line), one conductor being connected to the inner helix, the other 
to the outer helix. Such an arrangement would cause something like the 
transverse (+—) mode to be set up on the helices. If the two con- 
ductors and the balanced line can be shielded from each other starting 
some distance from the helices then it is, in principle, possible to intro- 
duce arbitrary amounts of extra delay into one of the conductors. A delay 
of one half period would then cause the longitudinal (+--+) mode to be 
set up in the helices. Clearly such a coupling scheme would not be 
broad-band since a frequency-independent delay of one half period is not 
realizable. 

Other objections to both of these schemes are: Balanced lines are not 
generally used at microwave frequencies; it is difficult to bring leads 
through the envelope of a TWT without causing reflection of RF energy 
and without unduly encumbering the mechanical design of the tube plus 
circuits; both schemes are necessarily inexact because helices having 
different radii will, in general, require different voltages at either input 
in order to be excited in a pure mode. 

Thus the practicability, and success, of any general scheme based on 
the existence of a pure transverse or a pure longitudinal mode on coupled 
helices will depend to a large extent on whether elegant coupling means 
are available. Such means are indeed in existence as will be shown in the 
next sections. 


3.1.2 Tapered Coupler 


A less direct but more elegant means of coupling an external circuit 
to either normal mode of a double helix arrangement is by the use of the 
so-called “tapered”’ coupler.*' *: ” By appropriately tapering the relative 
propagation velocities of the inner and outer helices, outside the inter- 
action region, one can excite either normal mode by coupling to one 
helix only. 

The principle of this coupler is based on the fact that any two coupled 
transmission lines support two, and only two, normal modes, regardless 
of their relative phase velocities. These normal modes are characterized 
by unequal wave amplitudes on the two lines if the phase velocities are 
not equal. Indeed the greater the phase velocity difference and/or 
the smaller the coupling coefficient between the lines, the more their 
wave amplitudes diverge. Furthermore, the wave amplitude on the line 
with the slower phase velocity is greater for the out-of-phase or trans- 
verse normal mode, and the wave amplitude on the faster line is greater 
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for the longitudinal normal mode. As the ratio of phase constant to 
coupling constant approaches infinity, the ratio of the wave amplitudes 
on the two lines does also. Finally, if the phase velocities of, or coupling 
between, two coupled helices are changed gradually along their length 
the normal modes existing on the pair roughly maintain their identity 
even though they change their character. Thus, by properly tapering the 
phase velocities and coupling strength of any two coupled helices one 
can cause the two normal modes to become two separate waves, one 
existing on each helix. 

For instance, if one desires to extract a signal propagating in the in- 
phase, or longitudinal, normal mode from two concentric helices of equal 
phase velocity, one might gradually increase the pitch of the outer helix 
and decrease that of the inner, and at the same time increase the diameter 
of the outer helix to decrease the coupling, until the longitudinal mode 
exists as a wave on the outer helix only. At such a point the outer helix 
may be connected to a coaxial line and the signal brought out. 

This kind of coupler has the advantage of being frequency insensitive; 
and, perhaps, operable over bandwidths upwards of two octaves. It 
has the disadvantage of being electrically, and sometimes physically, 
quite long. 


3.1.3 Stepped Coupler 


There is yet a third way to excite only one normal mode on a double 
helix. This scheme consists of a short length at each end of the outer helix, 
for instance, which has a pitch slightly different from the rest. This 
has been called a “stepped” coupler. 

The principle of the stepped coupler is this: If two coupled transmis- 
sion lines have unlike phase velocities then a wave initiated in one line 
can never be completely transferred to the other, as has been shown in 
Section 2.4. The greater the velocity difference the less will be the maxi- 
mum transfer. One can choose a velocity difference such that the maxi- 
mum power transfer is just one half the initial power. It is a characteristic 
of incomplete power transfer that at the point where the maximum trans- 
fer occurs the waves on the two lines are exactly either in-phase or out-of- 
phase, depending on which helix was initially excited. Thus, the condi- 
tions for a normal mode on two equal-velocity helices can be produced 
at the maximum transfer point of two unlike velocity helices by initiating 
a wave on only one of them. If at that point the helix pitches are changed 
to give equal phase velocities in both helices, with equal current or volt- 
age amplitude on both helices, either one or the other of the two normal 
modes will be propagated on the two helices from there on. Although the 
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pitch and length of such a stepped coupler are rather critical, the re- 
quirements are indicated in the equations in Section 2.4. 

The useful bandwidth of the stepped coupler is not as great as that 
of the tapered variety, but may be as much as an octave. It has however 
the advantage of being very much shorter and simpler than the tapered 
coupler. 


3.2 Low-Noise Transverse-Field Amplifier 


One application of coupled helices which has been suggested from the 
very beginning is for a transverse field amplifier with low noise factor. 
In such an amplifier the RF structure is required to produce a field which 
is purely transverse at the position of the beam. For the transverse mode 
there is always such a cylindrical surface where the longitudinal field is 
zero and this can be obtained from the field equation of Appendix II. 
In Fig. 3.1 we have plotted the value of the radius 7 at which the longi- 
tudinal field is zero for various parameters. The significant feature of 
this plot is that the radius which specifies zero longitudinal field is not 
constant with frequency. At frequencies away from the design frequency 
the electron beam will be in a position where interaction with longitudinal 
components might become important and thus shotnoise power will be 
introduced into the circuit. Thus the bandwidth of the amplifier over 
which it has a good noise factor would tend to be limited. However, this 
effect can be reduced by using the smallest practicable value of b/a. 

Section 2.12 indicates that the impedance of the transverse mode is 
very high, and thus this structure should be well suited for transverse 
field amplifiers. 


3.3 Dispersive Traveling-Wave Tube 


Largé bandwidth is not always essential in microwave amplifiers. In 
particular, the enormous bandwidth over which the traveling-wave tube 
is potentially capable of amplifying has so far found little application, 
while relatively narrow bandwidths (although quite wide by previous 
standards) are of immediate interest. Such a relatively narrow band, if 
it is an inherent electronic property of the tube, makes matching the 
tube to the external circuits easier. It may permit, for instance, the use 
of non-reciprocal attenuation by means of ferrites in the ferromagnetic 
resonance region. It obviates filters designed to deliberately reduce the 
band in certain applications. Last, but not least, it offers the possibility 
of trading bandwidth for gain and efficiency. 

A very simple method of making a traveling-wave tube narrow-band 
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g. 3.1 — The radius 7 at which the longitudinal field is zero for transversely 
deeited coupled coaxial helices. 


is by using a dispersive circuit, (i.e. one in which the phase velocity varies 
significantly with frequency). Thus, we obtain an amplifier that can be 
tuned by varying the beam voltage; being dispersive we should also 
expect a low group velocity and therefore higher circuit impedance. 

Calculations of the phase velocities of the normal modes of coupled 
concentric helices presented in the appendix show that the fast, longitu- 
dinal or (++) mode is highly dispersive. Given the geometry of two 
such coupled helices and the relevant data on an electron beam, namely 
current, voltage and beam radius, it is possible to arrive at an estimate 
of the dependence of gain on frequency. 

Experiments with such a tube showed a bandwidth 3.8 times larger 
than the simple estimate would show. This we ascribe to the presence 
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of the dielectric between the helices in the actual tube, and to the neglect 
of power propagated in the form of spatial harmonics. 

Nevertheless, the tube operated satisfactorily with distributed non- 
reciprocal ferrite attenuation along the whole helix and gave, at the 
center frequency of 4,500 mc/s more than 40 db stable gain. 

The gain fell to zero at 3,950 mc/s at one end of the band and at 
4,980 me/s at the other. The forward loss was 12 db. The backward 
loss was of the order of 50 db at the maximum gain frequency. 


3.4 Devices Using Both Modes ‘ 


In this section we shall discuss applications of the coupled-helix princi- 
ple which depend for their function on the simultaneous presence of both 
the transverse and the longitudinal modes. When present in substantially 
equal magnitude a spatial beat-phenomenon takes place, that is, RF 
power transfers back and forth between inner and outer helix. 

Thus, there are points, periodic with distance along each helix, where 
there is substantially no current or voltage; at these points a helix can be 
terminated, cut-off, or connected to external circuits without detriment. 

The main object, then, of all devices discussed in this section is power 
transfer from one helix to the other; and, as will be seen, this can be ac- 
complished in a remarkably efficient, elegant, and broad-band manner. 


3.4.1 Coupled-Helix Transducer 


It is, by now, a well known fact that a good match can be obtained 
between a coaxial line and a helix of proportions such as used in TWT’s. A 
wire helix in free space has an effective impedance of the order of 100 
ohms. A conducting shield near the helix, however, tends to reduce the 
helix impedance, and a value of 70 or even 50 ohms is easily attained. 
Provided that the transition region between the coaxial line and the 
helix does not present too abrupt a change in geometry or impedance, 
relatively good transitions, operable over bandwidths of several octaves, 
can be made, and are used in practice to feed into and out of tubes em- 
ploying helices such as TWT’s and backward-wave oscillators. 

One particularly awkward point remains, namely, the necessity to lead 
the coaxial line through the tube envelope. This is a complication in 
manufacture and requires careful positioning and dimensioning of the 
helix and other tube parts. ' 

Coupled helices offer an opportunity to overcome this difficulty i in the 
form of the so-called coupled-helix transducer, a sketch of which is 
shown in Fig. 3.2. As has been shown in Section 2.3, with helices having 
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the same velocity an overlap of one half of a beat wavelength will result 
in a 100 per cent power transfer from one helix to the other. A signal in- 
troduced into the outer helix at point A by means of the coaxial line will 
be all on the inner helix at point B, nothing remaining on the outer helix. 
At that point the outer helix can be discontinued, or cut off; since there 
is no power there, the seemingly violent discontinuity represented by the 
‘open” end of the helix will cause no reflection of power. In practice, un- 
fortunately, there are always imperfections to consider, and there will 
often be some power left at the end of the coupler helix. Thus, it is de- 
sirable to terminate the puter helix at this point non-reflectively, as, for 
instance, by a resistive element of the right value, or by connecting to it 
another matched coaxial line which in turn is then non-reflectively ter- 
minated. 

It will be seen, therefore, that the coupled-helix transducer can, in 
principle, be made into an efficient device for coupling RF energy from 
a coaxial line to a helix contained in a dielectric envelope such as a glass 
tube. The inner helix will be energized predominantly in one direction, 
namely, the one away from the input connection. Conversely, energy 
traveling initially in the inner helix will be transferred to the outer, and 
made available as output in the respective coaxial line. Such a coupled- 
helix transducer can be moved along the tube, if required. As long as the 
outer helix completely overlaps the inner, operation as described above 
should be assured. By this means a new flexibility in design, operation 
and adjustment of traveling-wave tubes is obtained which could not be 
achieved by any other known form of traveling-wave tube transducer. 

Naturally, the applications of the coupled-helix transducer are not 
restricted to TWT’s only, nor to 100 per cent power transfer. To obtain 





Fig. 3.2 — A simple coupled helix transducer. 
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power transfer of proportions other than 100 per cent two possibilities 
are open: either one can reduce the length of the synchronous coupling 
helix appropriately, or one can deliberately make the helices non-syn- 
chronous. In the latter case, a considerable measure of broad-banding 
can be obtained by making the length of overlap again equal to one half 
of a beat-wavelength, while the fraction of power transferred is deter- 
mined by the difference of the helix velocities according to 2.4.7. An 
application of the principle of the coupled-helix transducer to a variable 
delay line has been described by L. Stark” in an unpublished memo- 
randum. 

Turning again to the complete power transfer case, we may ask: 
How broad is such a coupler? 

In Section 2.7 we have discussed how the radial falling-off of the RF 
energy near a helix can be used to broad-band coupled-helix devices 
which depend on relative constancy of beat-wavelength as frequency 
is varied. On the assumption that there exists a perfect broad-band match 
between a coaxial line and a helix, one can calculate the performance of 
a coupled-helix transducer of the type shown in Fig. 3.2. 

Let us define a center frequency w, at which the outer helix is exactly 
one half beat-wavelength, d,, long. If w is the frequency of minimum 
beat wavelength then at frequencies w, and w., larger and smaller, 
respectively, than w, the outer helix will be a fraction 6 shorter than 
16m, , (Section 2.7). Let a voltage amplitude, V2, exist at the point where 
the outer helix is joined to the coaxial line. Then the magnitude of the 
voltage at the other end of the outer helix will be | V2-sin (76/2) | which 
means that the power has not been completely transferred to the inner 
helix. Let us assume complete reflection at this end of the outer helix. 
Then all but a fraction of the reflected power will be transferred to the 
inner helix in a reverse direction. Thus, we have a first estimate for the 
“directivity” defined as the ratio of forward to backward power (in db) 
introduced into the inner helix: 


D =] 10 log sin’ (2) | (3.4.1.1) 


2 





We have assumed a perfect match between coaxial line and outer helix; 
thus the power reflected back into the coaxial line is proportional to 
sin‘(76/2). Thus the reflectivity defined as the ratio of reflected to 
incident power is given in db by 


R = 10 log sin* (") (3.4.1.2) 
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For the sake of definiteness, let us choose actual figures: let Ba = 2.0. 
and b/a = 1.5. And let us, arbitrarily, demand that P always be less than 
—20 db. 

This gives sin (76/2) < 0.3816 and 76/2 < 18.42° or 0.294 radians, 
5 < 0.205. With the optimum value of B.a = 1.47, this gives the mini- 
mum permissible value of 8.a of 1.47/(1 + 0.205) = 1.22. From the 
graph on Fig. 2.2 this corresponds to values of Ba of 1.00 and 3.50. 
Therefore, the reflected power is down 20 db over a frequency range of 
we/a, = 3.5 to one. Over the same range, the directivity is better than 
10 to one. Suppose a directivity of better than 20 db were required. 
This requires sin (76/2) = 0.10, 6 = 0.0638 and is obtained over a fre- 
quency range of approximately two to one. Over the same range, the 
reflected power would be down by 40 db. 

In the above example the full bandwidth possibilities have not been 
used since the coupler has been assumed to have optimum length when 
8.4 is maximum. If the coupler is made longer so that when ,a is maxi- 
mum it is electrically short of optimum to the extent permissible by 
the quality requirements, then the minimum allowable 6,a becomes even 
smaller. Thus, for b/a = 1.5 and directivity 20 db or greater the rea- 
lizable bandwidth is nearly three to one. 

When the coupling helix is non-reflectively terminated at both ends, 
either by means of two coaxial lines or a coaxial line at one end and a 
resistive element at the other, the directivity is, ideally, infinite, irrespec- 
tive of frequency; and, similarly, there will be no reflections. The power 
transfer to the inner helix is simply proportional to cos’ (16/2). Thus, 
under the conditions chosen for the example given above, the coupled- 
helix transducer can approach the ideal transducer over a considerable 
range of frequencies. 

So far, we have inspected the performance and bandwith of the 
coupled-helix transducer from the most optimistic theoretical point of 
view. Although a more realistic approach does not change the essence 
of our conclusions, it does modify them. For instance, we have neglected 
dispersion on the helices. Dispersion tends to reduce the maximum at- 
tainable bandwidth as can be seen if Fig. 2.4.2 rather than Fig. 2.2 is 
used in the example cited above. The dielectric that exists in the annular 
region between coupled concentric helices in most practical couplers 
may also affect the bandwidth. 

In practice, the performance of coupled-helix transducers has been 
short of the ideal. In the first place, the match from a coaxial line to a 
helix is not perfect. Secondly, a not inappreciable fraction of the RF 
power on a real wire helix is propagated in the form of spatial harmonic 
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Fig. 3.8 — The return loss and directivity of an experimental 100 per cent 
coupled-helix transducer. 


wave components which have variations with angle around the helix- 
axis, and coupling between such components on two helices wound in 
opposite directions must be small. Finally, there are the inevitable me- 
chanical inaccuracies and misalignments. 

Fig. 3.3 shows the results of measurements on a coupled-helix trans- 
ducer with no termination at the far end. 


3.4.2 Coupled-Helix Attenuator 


In most TWT’s the need arises for a region of heavy attenuation 
somewhere between input and output; this serves to isolate input and 
output, and prevents oscillations due to feedback along the circuit. Be- 
cause of the large bandwidth over which most TWT’s are inherently 
capable of amplifying, substantial attenuation, say at least 60 db, is 
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required over a bandwidth of maybe 2 octaves, or even more. Further- 
more, such attenuation should present a very good match to a wave on 
the helix, particularly to a wave traveling backwards from the output 
of the tube since such a wave will be eapanes by the output section of 
the tube. 

Another requirement is that the attenuator should be physically as 
short as possible so as not to increase the length of the tube unneces- 
sarily. 

Finally, such attenuation might, with advantage, be made movable 
during the operation of the tube in order to obtain optimum performance, 
perhaps in respect of power output, or linearity, or some other aspect. 

Coupled-helix attenuators promise to perform these functions satis- 
factorily. 

A length of outer helix (synchronous with the inner helix) one half of a 
beat wavelength long, terminated at either end non-reflectively, forms a 
very simple, short, and elegant solution of the coupled-helix attenuator 
problem. A notable weakness of this form of attenuator is its relatively 
narrow bandwidth. Proceeding, as before, on the assumption that the 
attenuator is a fraction 6 larger or smaller than half a beat wavelength 
at frequencies w; and w2 on either side of the center frequency w, we find 
that the fraction of power transferred from the inner helix to the attenu- 
ator is then given by (1 — sin’ (76/2)). The attenuation is thus simply 


tet, Bae 2 16 
A = sin? (7) 


For helices of the same proportions as used before in Section 3.4.1, we 
find that this will give an attenuation of at least 20 db over a frequency 
band of two to one. At the center frequency, wo, the attenuation is in- 
finite; — in theory. 

Thus to get higher attenuation, it would be necessary to arrange for a 
sufficient number of such attenuators in tandem along the TWT. More- 
over, by properly staggering their lengths within certain ranges a wider 
attenuation band may be achieved. The success of such a scheme largely 
depends on the ability to terminate the helix ends non-reflectively. Con- 
siderable work has been done in this direction, but complete success is 
not yet in sight. 

Another basically different scheme for a coupled-helix attenuator rests 
on the use of distributed attenuation along the coupling helix. The diffi- 
culty with any such scheme lies in the fact that wnequal attenuation in 
the two coupled helices reduces the coupling between them and the more 
they differ in respect to attenuation, the less the coupling. Naturally, one 
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would wish to have as little attenuation as practicable associated with 
the inner helix (inside the TWT). This requires the attenuating element 
to be associated with the outer helix. Miller’ has shown that the maxi- 
mum total power reduction in coupled transmission systems is obtained 
when 

a, — a 


Bo 


where a: and a are the attenuation constants in the respective systems, 
and $8, the beat phase constant. If the inner helix is assumed to be loss- 
less, the attenuation constant of the outer helix has to be effectively equal 
to the beat wave phase constant. It turns out that 60 db of attenuation 
requires about 3 beat wavelengths (in practice 10 to 20 helix wave- 
lengths). The total length of a typical TWT is only 3 or 4 times that, 
and it will be seen, therefore, that this scheme may not be practical as 
the only means of providing loss. 

Experiments carried out with outer helices of various resistivities and 
thicknesses by K. M. Poole (then at the Clarendon Laboratory, Oxford, 
England) tend to confirm. this conclusion. P. D. Lacy” has described a 
coupled helix attenuator which uses a multifilar helix of resistance 
material together with a resistive sheath between the helices. 

Experiments were performed at Bell Telephone Laboratories with a 
TWT using a resistive sheath (graphite on paper) placed between the 
outer helix and the quartz tube enclosing the inner helix. The attenua- 
tions were found to be somewhat less than estimated theoretically. The 
attenuator helix was movable in the axial direction and it was instructive 
to observe the influence of attenuator position on the power output from 
the tube, particularly at the highest attainable power level. As one might 
expect, as the power level is raised, the attenuator has to be moved nearer 
to the input end of the tube in order to obtain maximum gain and power 
output. In the limit, the attenuator helix has to be placed right close to 
the input end, a position which does not coincide with that for maximum 
low-level signal gain. Thus, the potential usefulness of the feature of 
mobility of coupled-helix elements has been demonstrated. 


~ 1 








4, CONCLUSION 


In this paper we have made an attempt to develop and collect together 
a considerable body of information, partly in the form of equations, 
partly in the form of graphs, which should be of some help to workers 
in the field of microwave tubes and devices. Because of the crudity of the 
assumptions, precise agreement between theory and experiment has not 
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been attained nor can it be expected. Nevertheless, the kind of physical 
phenomena occurring with coupled helices are, at least, qualitatively 
described here and should permit one to develop and construct various 
types of devices with fair chance of success. 
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APPENDIX I 
I. SOLUTION OF FIELD EQUATIONS 


In this section there is presented the field equations for a transmission 
system consisting of two helices aligned with a common axis. The propa- 
gation properties and impedance of such a transmission system are dis- 
cussed for various ratios of the outer helix radius to the inner helix radius. 
This system is capable of propagating two modes and as previously 
pointed out one mode is characterized by a longitudinal field midway 
between the two helices and the other is characterized by a transverse 
field midway between the two helices. 

The model which is to be treated and shown in Fig. 2.3 consists of an 
inner helix of radius a and pitch angle y; which is coaxial with the outer 
helix of radius b and pitch angle ye. The sheath helix model will be 
treated, wherein it is assumed that helices consist of infinitely thin sheaths 
which allow for current flow only in the direction of the pitch angle y. 

The components of the field in the region inside the inner helix, be- 
tween the two helices and outside the outer helix can be written as 
follows — inside the inner helix 


H., = Bilo(yr) (1) 
E,, = Belo(yr) (2) 
Hy, = J = Beli (yr) (3) 
H,, = 2 Bilan) (4) 
Be = —J— Br) (5) 


B,, = 2 Bebo) (6) 
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and between the two helices 
B3I o(yr) i B.Ko(yr) 
BsIo(yr) + B.Kolyr) 


Hi, 


H¢, 


I 


I 


me [Bsli(yr) — BeKi(yr)] 


. [Bsli(yr) — BiKilyr)] 


i8 


and outside the outer helix 


H 
i 


Ry 
28 


Hes 


Hy, 


Le, 


j = [Bsli(yr) — BsKy(yr)] 


a [Bsli(yr) — BeKi(yr)] 


= B,Ko(yr) 
= BsKo(yr) 


~j = BsKi(yr) 





aa B:Ki(yr) 
Y 


j a B,Ka(yr) 


ll 


— IB BK yr) 
Y 
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(7) 
(8) 


(9) 


(10) 


(11) 


(12) 


(13) 
(14) 


(15) 


(16) 


(17) - 


(18) 


With the sheath helix model of current flow only in the direction of wires 
we can specify the usual boundary conditions that at the inner and outer 
helix radius the tangential electric field must be continuous and per- 
pendicular to the wires, whereas the tangential component of magnetic 
field parallel to the current flow must be continuous. These can be written 


as 


E,sny + E, cosy = 0 


E,, E, and (H, siny + H, cosy) be equal on either side of the helix. 
By applying these conditions to the two helices the following equations 
are obtained for the various coefficients. 


(19) 
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First, we will define a more simple set of parameters. We will denote 
In(ya) by Tox and Io(yb) by Lo ; cte. 


Further let us use the notation introduced by Humphrey, Kite and 
James" in his treatment of coaxial helices. 


Pa = InkKo Po = InKoe R=. olX oe 


(20) 
Py = InKu Py = Inky» Ry = Ink. 
and define a common factor (C.F.) by the equation 
2 2 
ore- ees PoPos — {Bot cot Ha)’ cot bs p 2, 
(ya)? (ya)? cot Yi (21) 


+ Ro — PuPa| 


With all of this we can now write for the coefficients of equations 1 
through 18: 











Bt ee ae [eae ~ org | 2 
Be wa Le eae Sey " 


Be To | Se cot ye)” 


C¥. (ya)? 


B seer a 0) Pp - Pa| (26) 








Br. é Bot coty 1 In _ cot Ye | 
2 J / we OP: | Pek Sotwis Prolkty (27) 
Bs _ (Boa cot yi)” cot ye In’ | _ cot pe | 
B.~ GaP cotvi CFR | ~ cot oe 


The last equation necessary for the solution of our field problem is the 
transcendental equation for the propagation constant, y, which can be 


COUPLED HELICES 


written 


= (Bo a cot Ws)" cot Yo | 
E (ya)? cot Ry 


= | Pos — 





(ya)? 


(Bo a cot Yr)" P| | Pa ay 
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(29) 


(Boa cot yn)” Pas| 


(ya)? 


The solutions of this equation are plotted in Fig. 4.1. 

There it is seen that there are two values of y, one, y: , denoting the 
slow mode with transverse fields between helices and the other, y., 
denoting the fast mode with longitudinal fields midway between the two 


helices. 


4.0 


ea. Vee 











Tam 
SEE 


N 
‘\ 


XN \ 
N 





En 
ae 
\ | 
S| 
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Fig. 4.1.1— The radial propagation constants associated with the transverse 
and longitudinal modes on coupled coaxial sheath helices given as a function of 


Boa cot y: for several values of b/a = 1.25. 
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These equations can now be used to compute the power flow as defined 
by 








P=\%Re | BX H*dA (30) 
which can be written in the form 
E2(0)} ; 
| =O) = (2) F(ya, yb) (31) 
where 
(Boa cot wy 7 
T 2 NO FEE 
_ (ya)? Io? (r+ Gor " Iuln)\(OFY 
-s_ (ya) In Jot’ = Sort )(0.F.)? 
[F (va, yb)] 240 (C.F) ee (es cot By 
01 411 ya 


(Boa cot yx)” ‘\( Ry - (Boa cot x1)” cot Ye y 
(ya)? me ee (ya? cot ms 


1(¢) CFs enw ee ee Inte) | 


(Boa cot yr)" cot Y2 .) ( _ Boa cot yo)” ) 
‘ (2 - Gar cota) VV Gay * 


~ (Ke! + 


2 
(2) (Qik 12 + Toke + Leek 02) — (22k + Loa + I ape (82) 


[ie ghee Meat 


2 
(2) (Rake key Geka Ku!) | 


(Boa cot yx)" (2) | 2 , (Boa cot We)” 7 2 "| 
a (yay KieRe \a Ro + (ae Tov Ki 


| Pelt =a ” ve 
cot Wy 





P. fe | [KooKos — Kis’ 


In (32) we find the power in the transverse mode by using values of 
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Fig. 4.1.2 — The radial propagation constants associated with the transverse 
and longitudinal modes on coupled coaxial sheath helices given as a function of 
Boa cot ¥, when b/a = 1.50. 


7: obtained from (29) and similarly the power in the longitudinal mode is 
found by using values of yz. 


Il. FINDING 7 


When coaxial helices are used in a transverse field amplifier, only the 
transverse field mode is of interest and it is important that the helix 
parameters be adjusted such that there is no longitudinal field at some 
radius, 7, where the cylindrical electron beam will be located. This condi- 
tion can be expressed by equating FH, to zero at r = fF and from (8) 


Belor) + BeKo(yr) = 0 (33) 
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which can be written with (25) and (26) as 


__ (Boa cot 1)” cot pe : 
Kee | fe (yay? Cotd; Ra | Toy?) 





34 
(Boa cot wo)’ en) 


(ya)? 


This equation together with (29) enables one to evaluate 7/a versus ya 
cot y for various ratios of b/a and cot ¥2/cot ¥1. The results of these 
calculations are shown in Fig. 3.1. 


= Iy | Pe = P| Ko(y7r) 
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Tig. 4.1.3 — The radial propagation constants associated with the transverse 
and longitudinal modes on coupled coaxial sheath helices given as a function of 
Boa cot Yi when b/a = 1.75. 
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Fig. 4.1.4 — The radial propagation constants associated with the transverse 
and longitudinal modes on coupled coaxial sheath helices given as a function of 
Boa cot y; when b/a = 2.0. 


III. COMPLETE POWER TRANSFER 


For coupled helix applications we require the coupled helix parame- 
ters to be adjusted so that RF power fed into one helix alone will set up 
the transverse and longitudinal modes equal in amplitude. For this 
condition the power from the outer helix will transfer completely to the 
inner helix. The total current density can be written as the sum of the 
current in the longitudinal mode and the transverse mode. Thus for the 
inner helix we have 


Ja = Jae BY! A Tqe7B# (35) 
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Fig. 4.1.5 — The radial propagation constants associated with the transverse 
and longitudinal modes on coupled coaxial sheath helices given as a function of 
Boa cot ¥: when b/a = 3.0. 


and for the outer helix 


Jn = Ince?" + Iie PY (36) 
For complete power transfer we ask that 
Joe = Joi 
_ when J, is zero at the input (2 = 0) 
Jat = —Jat 
or 
Le ae (37) 


Jat Jat 
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Now Jaz is equal to the discontinuity in the tangential component of 
magnetic field which can be written at r = a 
Jat = (Hs cos 1 — Hes sin) — (Ha cosy — Hy sin tn) 


which ean be written as 


Jat = —(Ha — H.s)qe(cot fr + tan yx) sin ya (38) 
and similarily at r = b 
Joe = —(Haz — H,3)t(cot wo + tan ye) sin Yo (39) 


Equations (88) and (89) can be combined with (37) to give as the condi- 
tion for complete power transfer 


Ar;= —A; Z (40) 


where 


t 2 
Ug Tak) (Pa - ware Pa) 
to i ame 
(Lou + LnkKo1) (ze = =e Sot . Ra) 

In (40) Az is obtained by substituting y,; into (41) and A, is obtained by 
substituting y; into (41). 

The value of cot ¥2/cot ¥; necessary to satisfy (40) is plotted in Fig. 
2.8. 

In addition to cot Y2/cot y1 it is necessary to determine the interference 
wavelength on the helices and this can be readily evaluated by consider- 
ing (86) which can now be written 


J, = Ile e* a a 





or 
Ty = Suq@7 OBO ogg (Br 5 B.) F (48) 

and 
yee je ON cos 4opyz (49) 


where we have defined 


Boa = (ya — 0) (50) 
This value of 6 is plotted versus oa cot Yr in Fig. 2.4. 
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Statistical Techniques for Reducing the 
Experiment Time in Reliability Studies 


By MILTON SOBEL 
(Manuscript received September 19, 1955) 


Given two or more processes, the units from which fail in accordance with 
an exponential or delayed exponential law, the problem 7s to select the partic- 
ular process with the smallest failure rate. It is assumed that there ts a com- 
mon guarantee period of zero or positive duration during which no failures 
occur. This guarantee period may be known or unknown. It is desired to 
accomplish the above goal in as short a time as possible without invalidating 
certain predetermined probability specifications. Three statistical techniques 
are considered for reducing the average experiment time needed to reach a 
decision. 

1. One technique is to increase the initial number of units put on test. 
This technique will substantially shorten the average experiment time. Its 
effect on the probability of a correct selection is generally negligible and in 
some cases there is no effect. 

2. Another technique is to replace each failure immediately by a new 
unit from the same process. This replacement technique adds to the book- 
keeping of the test, but if any of the population variances is large (say in 
comparison with the guarantee period) then this technique will result in a 
substantial saving in the average experiment time. 

8. A third technique is to use an appropriate sequential procedure. In 
many problems the sequential procedure results in a smaller average expert- 
ment time than the best non-sequential procedure regardless of the true 
fatlure rates. The amount of saving depends principally on the ‘distance’ 
between the smallest and second smallest failure rates. 

For the special case of two processes, tables are given to show the proba- 
belety of a correct selection and the average experiment time for each of three 
types of procedures. 

Numerical estimates of the relative efficiency of the procedures are given 
by computing the ratio of the average experiment time for two procedures of 
different type with the same initial sample size and satisfying the same 
probability spectfication. ‘ . 
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INTRODUCTION 


This paper is concerned with a study of the advantages and disad- 
vantages of three statistical techniques for reducing the average dura- 
tion of life tests. These techniques are: 

1. Increasing the initial number of units on test. 

2. Using a replacement technique. 

3. Using a sequential procedure. 

To show the advantages of each of these techniques, we shall consider 
the problem of deciding which of two processes has the smaller failure 
rate. Three different types of procedures for making this decision will 
be considered. They are: 

Ri, A nonsequential, nonreplacement type of procedure 
R., A nonsequential, replacement type of procedure 
R;, A sequential, replacement type of procedure 

Within each type we will consider different values of n, the initial 
number of units on test for each process. The effect of replacement is 
shown by comparing the average experiment time for procedures of 
type 1 and 2 with the same value of nm and comparable probabilities of a 
correct selection. The effect of using a sequential rule is shown by com- 
paring the average experiment time for procedures of type 2 and 3 with 
the same value of n and comparable probabilities of a correct selection. 


ASSUMPTIONS 


‘1. It is assumed that failure is clearly defined and that failures are 
recognized without any chance of error. 

2. The lifetime of individual units from either population i is assumed 
to follow an exponential density of the form 


1 —(@-—s)/0 
f(z3 0,9) =-e** forz 2g 
e (1) 


f(z; 6,9) = 0 fora <g 


where the location parameter g = 0 represents the common guarantee 
period and the scale parameter @ > 0 represents the unknown parameter 
which distinguishes the two different processes. Let 6, = 62 denote the 


ordered values of the unknown parameter @ for the two processes; then 
the ordered failure rates are given by 


= 1/( + 9) Ss = 1/(& + g) (2) 
3. It is not known which process has the parameter 6; and which has 


the parameter @ . 
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4, The parameter g is assumed to be the same for both processes. It 
may be known or unknown. 

5. The initial number n of units put on test is the same for both pro- 
cesses. 

6. All units have independent lifetimes, i.e., the test environment is 
not such that the failure of one unit results in the failure of other units 
on test. 

7. Replacements used in the test are assumed to come from the same 
population as the units they replace. If the replacement units have to 
sit on a shelf before being used then it is assumed that the replacements 
are not affected by shelf-aging. 


CONCLUSIONS 


1. Increasing the initial sample size n has at most a negligible effect 
on the probability of a correct selection. It has a substantial effect on the © 
average experiment time for all three types of procedures. If the value of 
n is doubled, then the average time is reduced to a value less than or 
equal to half of its original value. 

2. The technique of replacement always reduces the average experi- 
ment time. This reduction is substantial when g = 0 or when the popu- 
lation variance of either process is large compared to the value of g. 
This decrease in average experiment time must always be weighed against 
the disadvantage of an increase in bookkeeping and the necessity of 
having the replacement units available for use. 

3. The sequential procedure enables the experimenter to make rational 
decisions as the evidence builds up without waiting for a predetermined 
number of failures. It has a shorter average experiment time than non- 
sequential procedures satisfying the same specification. This reduction 
brought about by the sequential procedure increases as the ratio a of 
the two failure rates increases. In addition the sequential procedure 
always terminates with a decision that is clearly convincing on the basis 
of the observed results, i.e., the 4 posteriori probability of a correct 
selection is always large at the termination of the experiment. 


SPECIFICATION OF THE TEST 


Each of the three types of procedures is set up so as to satisfy the 
same specification described below. Let a denote the true value of the 
ratio 0,/@, which by definition must be greater than, or equal to, one. 
It turns out that in each type of procedure the probability of a correct 
selection depends on 6; and 6, only through their ratio a. 


182 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 


1. The experimenter is asked to specify the smallest value of a (say 
it isa* > 1) that is worth detecting. Then the interval (1, a*) represents 
a zone of indifference such that if the true ratio a lies therein then we 
would still like to make a correct selection, but the loss due to a wrong 
selection in this case is negligible. 

2. The experimenter is also asked to specify the minimum value P* > 
1g that he desires for the probability of a correct selection whenever 
a 2 a*. In each type of procedure the rules are set up so that the proba- 
bility of a correct selection for a = a* is as close to P* as possible without 
being less than P*, 

The two constants a* > 1 and 4 <:-P* < 1 are the only quantities 
specified by the experimenter. Together they make up the specification 
of the test procedure. , 


EFFICIENCY 


If two procedures of different type have the same value of n and satisfy 
the same specification then we shall regard them as comparable and 
their relative efficiency will be measured by the ratio of their average 
experiment times. This ratio is a function of the true e but we shall 
consider it only for selected values of a, namely, a = 1, a = a* and 


a= ©, 


PROCEDURES OF TYPE Ri; — NONSEQUENTIAL, NONREPLACEMENT 


“The same number n of units are put on test for each of the two pro- 
cesses. Experimentation is continued until either one of the two samples 
produces a predetermined number r (r S n) of failures. Experimenta- 
tion is then stopped and the process with fewer than r failures is chosen 
to be the better one.” . 


TABLE J — PROBABILITY OF A CORRECT SELECTION — PROCEDURE 
Tyre R, 


(a = 2, any g = QO, to be used to obtain r for a* = 2) 








n r=1 r=2 r= 3 r=4 
1 0.667 — — — 
2 0.667 0.733 — _— 
3 0.667 0.738 0.774 — 
4 0.667 0.739 0.784 0.802 
10 0.667 0.741 0.789 0.825 
20 0.667 0.741 0.790 0.826 
oo 0.667 0.741 0.790 0.827 








Note: The value for r = 0 is obviously 0.500 for any n. 
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We shall assume that the number v of units put on test is determined 
by non-statistical considerations such as the availability of units, the 
availability of sockets, etc. Then the only unspecified number in the 
above procedure is the integer r. This can be determined from a table 
of probabilities of a correct selection to satisfy any given specification 
(a*, P*). If, for example, a* = 2 then we can enter Table I. If n is 
given to be 4 and we wish to meet the specification a* = 2, P* = 0.800 
then we would enter Table I with n = 4 and select r = 4, it being the 
smallest value for which P = P*. 

The table above shows that for the given specification we would also. 
have selected r = 4 for any value of n. In fact, we note that the proba- 
bility of a correct selection depends only slightly on n. The given value 
of n and the selected value of r then determine a particular procedure 
of type Ri, say, Ri(n, 7). 

The average experiment time for each of several procedures Ri(n, 1) 
is given in Table II for the three critical values of the true ratio a, 
namely, a = 1a = a* anda = o. Each of the entries has to be multi- 
plied by 62, the smaller of the two 6 values, and added to the common 
guarantee period g. For n = © the entry should be zero (+g) but it 
was found convenient to put in place of zero the leading term in the 
asymptotic expansion of the expectation in powers of 1/n. Hence the 
entry for n = © can be used for any large n, say, n = 25 whenr S 4. 

We note in Table II the undesirable feature that for each procedure 
the average experiment time increases with a for fixed 6,. For the se- 
quential procedure we shall see later that the average experiment time 
is greater at a = a* than at either a = 1 ora = o. This is intuitively 
more desirable since it means that the procedure spends more time when 
the choice is more difficult to make and less time when we are indifferent 
or when the choice is easy to make. 


PROCEDURES OF TYPE R2-— NONSEQUENTIAL, REPLACEMENT 


‘Such procedures are carried out exactly as for procedures of Ry except 
that failures are immediately replaced by new units from the same 
population.” 

To determine the appropriate value of r for the specification e* = 2, 
P* = 0.800 when g = O we use the last row of Table I, i.e., the row 
marked n = ©, and select r = 4. The probability of a correct selection 
for procedures of type Re is exactly the same for all values of n and de- 
pends only on r. Furthermore, it agrees with the probability for pro- 
cedures of type Ri with n = © so that it is not necessary to prepare a 
separate table. 


Taste JI— Average Experiment Time — Procepure Tyre Rk, 
(Multiply entry by 6. and add g) 


0.500 0.667 1.000 — = — 
0.250 0.333 0.500 0.917 1.200 1.500 _ _ — 
0.167 0.222 0.333 0.517 0.675 0.833 1.217 1.572 1.833 


0.125 0.167 0.250 0.363 0.474 0.583 0.735 0.944 1.083 1.449 1.854 2.083 
0.050 0.067 0.100 0.132 0.172 0.211 0.231 0.297 0.336 0.347 0.439 0.479 
0.025 0.033 0.050 0.064 0.084 0.103 0.109 0.139 0.158 0.157 0.200 0.217 











0.500/n | 0.667/n | 1.000/n | 1.250/n | 1.630/n | 2.000/n | 2.063/n | 2.642/n | 3.000/n | 2.906/n | 3.669/n | 4.000/n 
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TasuiE III — Vantur or r REQUIRED TO MEET THE SPECIFICATION 















































(a*, P*) ror ProcepurEs oF Typrk Re (g = 0) 

P* 

1.05 | 1.10 | 145 | 1.20 | 1.25 | 1.30 | 1.35 | 1.40 | 1.45 | 1.50 | 2.00 | 2.50 | 3.00 
0.50 0 0 0 0 0 0 0; O; O| O}| O} O} O 
0.55 14 4 2 2 1 1 1 1 1 1 1 1 1 
0.60 55 15 7 5 3 3 2; 2) 2 1 1 1 1 
0.65 126 33 16 10 7 5 4/1 3] 3] 3 1 1 1 
0.70 232 61 29 17 12 9 71! 6) 54) 4) 2 1 1 
0.75 383 101 47 28 19 14 11 9| 7 6; 38] 2 1 
0.80 596 157 73 43 29 21 17 |; 13) 11 9) 4] 2 2 
0.85 903 | 238 | 111 65 44 32 25|}20/16/14; 5] 3 3 
0.90 | 1881 363 | 169 100 67 49 37 | 30 | 25 | 21 8 5 4 
0.95 | 2274 | 597 | 278 164 110 80 61 | 49 | 40} 384 | 12} 7] 5 
0.99 ) 4549 } 11938 } 556 | 327 | 219 } 160 | 122 ) 98 | 80} 68 | 24 | 14 } 10 




















It is also unnecessary to prepare a separate table for the average ex- 
periment time for procedures of type RF. since for g = 0 the exact values 
can be obtained by substituting the appropriate value of n in the ex- 
pressions appearing in Table IT in the row marked n = ©. For example, 
forn = 2,r = 1 anda = 1 the exact value for g = 0 is 0.500 @:/2 = 
0.250 6, and for n = 3,r = 4,a = © the exact value for g = Ois 
4.000 62/3 = 1.333 62. It should be noted that for procedures of type Rz 
we need not restrict our attention to the cases r < n but can also con- 
sider r > n. 

Table III shows the value of r required to meet the specification 
(a*, P*) with a procedure of type R» for various selected values of a* 
and P*. 


PROCEDURES OF TYPE R3— SEQUENTIAL, REPLACEMENT 


Let D(é) denote the absolute difference between the number of fail- 
ures produced by the two processes at any time ¢. The sequential pro- 
cedure is as follows: 

“Stop the test as soon as the inequality 


In [P*/(1 — P*)] 
In a* 


Di) 2 (3) 
is satisfied. Then select the population with the smaller number of fail- 
ures as the better one.” 

To get the best results we will choose (a*, P*) so that the right hand 
member of the inequality (8) is an integer. Otherwise we would be operat- 
ing with a higher value of P* (or a smaller value of a*) than was specified. 
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TabLte ITV— Average Exprriment Time AND PROBABILITY OF A 
Corricr SELECTION — Procepure Typr Pz 
(a* = 2, P* = 0.800, g = 0) 
(Multiply each average time entry by 62) 











n a=t1 a=2 a= 0 
1 2.000 2.400 2.000 
2 1.000 1.200 1.000 
3 0.667 0.800 0.667 
4 0.500 0.600 0.500 
10 0.200 0.240 0.200 
20 0.100 0.120 0.100 
eo 2.000/n 2.400/n 2.000/n 
Probability. .......... 0.500 0.800 1.000 


For example, we might choose a* = 2 and P* = 0.800. For procedures 
of type R; the probability of a correct selection is again completely in- 
dependent of n; here it depends only on the true value of the ratio a. 
The average experiment time depends strongly on n and only to a limited 
extent on the true value of the ratio a. Table IV gives these quantities 
fora = 1, a = 2, and a = o for the particular specification a* = 2, 
P* = 0.800 and for the particular value g = 0. 


EFFICIENCY 


We are now in a position to compare the efficiency of two different 
types of procedures using the same value of n. The efficiency of R, rcla- 
tive to fz is the reciprocal of the ratio of their average experiment time. 
This is given in Table V for a* = 2, P* = 0.800, r = 4andn = 4, 10, 20 
and o. By Table I the value P* = 0.800 is not attained for n < 4. 

In comparing the sequential and the nonsequential procedures it was 
found that the slight excesses in the last column of Table I over 0.800 


TABLE V — Erriciency oF Type R,; RELATIVE TO 
Tyre Ro 


(a* = 2, P* = 0.800,r = 4,9 = 0) 


n a=1 a=2 a=o 
4 0.501 0.495 0.480 
10 0.837 0.836 0.835 
20 0.925 0.917 0.922 
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TaBLeE VI — Erriciency or ApJusTED PR, ReLative To R; 
(a* = 2, P* = 0.800, g = 0) 





n a=1 a=2 a= o 
4 0.615 0.575 0.419 
10 0.754 0.708 0.528 
20 0.818 0.768 0.573 
roe 0.873 0.822 0.612 


had an effect on the efficiency. To make the procedures more comparable 
the values for r = 3 and r = 4 in Table I were averaged with values p 
and 1 — p computed so as to give a probability of exactly 0.800 at a = a*, 
The corresponding values for the average experiment time were then 
averaged with the same values p and 1 — p. The nonsequential pro- 
cedures so altered will be called ‘‘adjusted procedures.” The efficiency 
of the adjusted R, relative to R; is given in Table VI. 

In Table VI the last row gives the efficiency of the adjusted procedure 
R: relative to R;. Thus we can separate out the advantage due to 
the replacement feature and the advantage due to the sequential fea- 
ture. Table VII gives these results in terms of percentage reduction of 
average experiment time. 

We note that the reduction due to the replacement feature alone is 
greatest for small 7 and essentially constant with a while the reduction 


TasBLE VII — Per Cent Repucrion IN AVERAGE EXPERIMENT TIME 
DUE TO STATISTICAL TECHNIQUES 


(a* = 2, P* = 0.800, g = 0) 

















Reduction 
Reduction due to Reduction due to due to both 
a n Replacement Sequential Replacement 
Feature Alone Feature Alone and Sequential 
Features 
4 29.5 12.7 38.5 
1 10 13.7 12.7 24.6 
20 6.3 12.7 18.2 
0 0.0 12.7 12.7 
4 30.1 17.8 42.5 
2 10 13.9 17.8 29.2 
20 6.6 17.8 23.2 
eo 0.0 17.8 17.8 
4 31.5 38.8 58.1 
o 10 13.6 38.8 47.2 
20 6.3 38.8 42.7 
20 0.0 38.8 38.8 
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due to the sequential feature alone is greatest for large a and is inde- 
pendent of n. Hence if the initial sample size per process n is large we 
can disregard the replacement technique. On the other hand the true 
value of a is not known and hence the advantage of sequential experi- 
mentation should not be disregarded. 

The formulas used to compute the accompanying tables are given in 
Addendum 2. 
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ADDENDUM. 1 


In this addendum we shall consider the more general problem of select- 
ing the best of k exponential populations treated on a higher mathemati- 
cal level. For k = 2 this reduces to the problem discussed above. 


DEFINITIONS AND ASSUMPTIONS 


There are given k populations T; (¢ = 1, 2, --- , k) such that the life- 
times of units taken from any of these populations are independent 
chance variables with the exponential density (1) with a common (known 
or unknown) location parameter g 2 0. The distributions for the k popu- 
lations are identical except for the unknown scale parameter 6 > 0 which 
may be different for the k different populations. We shall consider three 
different cases with regard to g. 

Case 1: The parameter g has the value zero (g = 0). 

Case 2: The parameter g has a positive, known value (g > 0). 

Case 3: The parameter g is unknown (g 2 0). 

Let the ordered values of the k scale parameters be denoted by 


A2R2--- 2 (4) 


where equal values may be regarded as ordered in any arbitrary manner. 
At any time ¢ each population has a certain number of failures associated 
with it. Let the ordered values of these integers be denoted by r; = 7;,(t) 
so that 


Ni Sy Sass iE (5) 
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For each unit the life beyond its guarantee period will be referred to 
as its Poisson life. Let L,(#) denote the total amount of Poisson life 


observed up to time ¢ in the population with r; failures (¢ = 1,2, --- ,<). 
If two or more of the r; are equal, say r; = Tiga = +++ = Ti4;, then we 
shall assign r; and L; to the population with the largest Poisson life, 
riga and L;41 to the population with the next largest, --- , ri; and Lis; 


to the population with the smallest Poisson life. If there are two or more 
equal pairs (r;, L;) then these should be ordered by a random device 
giving equal probability to each ordering. Then the subscripts in (5) as 
well as those in (4) are in one-to-one correspondence with the k given 
populations. It should be noted that Z;(é) 2 0 for all 7 and any time 
¢t 2 0. The complete set of quantities L(t) (¢ = 1, 2,--- , &) need not 
be ordered. Let a = 6;/62 so that, since the 6; are ordered, a = 1. 

We shall further assume that: 

1. The initial number n of units put on test is the same and the start- 
ing time is the same for each of the k populations. 

2. Each replacement is assumed to be a new unit from the same popu- 
lation as the failure that it replaces. 

3. Failures are assumed to be clearly recognizable without any chance 
of error. 


SPECIFICATIONS FOR CASE 1: g = 0 


Before experimentation starts the experimenter is asked to specify two 
constants a* and P* such that a* > land 14 < P* < 1. The procedure 
Rs = R3(n), which is defined in terms of the specified a* and P*, has 
the property that it will correctly select the population with the largest 
scale parameter with probability at least P* whenever a = a*. The initial 
number n of units put on test may either be fixed by nonstatistical con- 
siderations or may be determined by placing some restriction on the 
average experiment time function. 


Rule R3: 


‘‘Continue experimentation with replacement until the inequality 


k 


Der Sl See (6) 


i=2 


is satisfied. Then stop and select the population with the smallest num- 
ber of failures as the one having the largest scale parameter.” 
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Remarks 


1. Since P* > 1% then (1 — P*)/P* < 1 and hence no two popula- 
tions can have the same value 7; at stopping time. 

2. For k = 2 the inequality (6) reduces to the inequality (3). 

3. The procedure R; terminates only at a failure time, never between 
failures, since the left member of (6) depends on ¢ only through the 
quantities 7;(¢). 

4. After experimentation is completed one can make, at the 100P per 
cent confidence level, the confidence statement 


6, s A, s a* 6, (or 61/a* < 05 S 61) (7) 


where @, is the scale parameter of the selected population. 


Numerical Illustrations 


Suppose the preassigned constants are P* = 0.95 and a* = 194 = 
2.088 so that (1 — P*)/P* = Vg. Then for k = 2 the procedure is to 
stop when r2 — 71 2 4. For k = 3 it is easy to check that the procedure 
reduces to the simple form: “Stop when rz — 7, 2 5”. For k > 3 either 
calculations can be carried out as experimentation progresses or a table 
of stopping values can be constructed before experimentation starts. 
For k = 4 and k = 5 see Table VIII. 


In the above form the proposed rule is to stop when, for at least one 


Tasie VIII — SrquentiaL Rutp ror P* = 0.95, a* = 19% 

































































k=4 k=5 

non | non | an—n | won | 
5 5 9 10 
5 5 10 10s 
5 6 6 8 
5 6 7 7 
5 7 7 7 * 
6 6 6 6 











* Starred rows can be omitted without affecting the test since every integer in 
these rows is at least as great as the corresponding integer in the previous row. 
They are shown here to illustrate a systematic method which insures that all the 
necessary rows are included. 
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row (say row j) in the table, the observed row vector (rz — 71, 
73 — M1, °**, 7: — 71) 1s such that each component is at least as large as 
the corresponding component of row j. 


Properties of R3 fork = 2andg = 0 


For k = 2 and g = 0 the procedure R; is an example of a Sequential 
Probability Ratio test as defined by A. Wald in his book.’ The Average 
Sample Number (ASN) function and the Operating Characteristics (OC) 
function for 23 can be obtained from the general formulae given by 
Wald. Both of these functions depend on 6; and 62 only through their 
ratio a. In our problem there is no excess over the boundary and hence 
Wald’s approximation formulas are exact. When our problem is put into 
the Wald framework, the symmetry of our problem implies equal proba- 
bilities of type 1 and type 2 errors. The OC function takes on comple- 
mentary values for any point a = 6,/62 and its reciprocal 62/6; . We shall 
therefore compute it only for a = 1 and denote it by P(a). For a > 1 
the quantity P(a@) denotes the probability of a correct selection for the 
true ratio a. 

The equation determining Wald’s h function’ is 











*\h *\—h ; 
(a*)" , ala) (8) 
lt+a l+ea 
for which the non-zero solution in h is easily computed to be 
Ine 


Hence we obtain from Wald’s formula (8:48) in Reference 5 


3 


Ele) ey (10) 
where s is the smallest integer greater than or equal to 
S = In [P*/(1 — P*)]/In a* (11) 
In particular, for « = 1°, a* and » we have 
P(*) = \s, yea Ca Ra a P(o) = 1 (12) 


We have written P(1*) above for lim P(x) as x > 1 from the right. The 
procedure becomes more efficient if we choose P and a* so that S is an 
integer. Then s = S and P(a*) = P*. 

Letting F’ denote the total number of observed failures required to 
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terminate the experiment we obtain for the ASN function 


7, = at+l1 a —1 
ECF; a) = s(2+ ') (= :) fora > 1 (18) 








and, in particular, fora = 1, © 


E(F;1)=s and HE(F; ©) =s (14) 
It is interesting to note that for s = 1 we obtain 
E(F;a) =1 foralla 21 (15) 


and that this result is exact since for s = 1 the right-hand: member S 
of (8) is at most one and hence the procedure terminates with certainty 
immediately after the first failure. 

As a result of the exponential assumption, the assumption of replace- 
ment and the assumption that g = 0 it follows that the intervals between 
failures are independently and identically distributed. For a single popu- 
lation the time interval between failures is an exponential chance vari- 
able. Hence, for two populations, the time interval is the minimum of 
two exponentials which is again exponential. Letting + denote the 
(chance) duration of a typical interval and letting 7 denote the (chance) 
total time needed to terminate the procedure, we have 


BOD a) = ECF; ed Ble; a, 0) = BOR; 0) (%)(s-2—) a6) 


Hence we obtain from (13) and (14) 





; _ 6 sa a —1 
ED Oy SS ay cea fora >1 (17) 
628° 628 
E(T; 1, 6.) == and ECT; ©, 6.) = — (18) 
2n n 


For the numerical illustration treated above with k = 2 we have 








4 
a 
P(a) = ae (19) 
Pt) =%;  P(2.088) = 0.95; P(o) =1 (20) 
up. .getie —-1_,@+1%@' +1) 
De See ae 8 ee oe 


E(F;1) = 16.0; E(F;2.088) = 10.2; HF; «) =4 (22) 
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6.96. 


E(T; 1, 62) = ; E(T; 2.088, 62) = aa 





86. 
an ‘ (23) 


n 


Tor k > 2 the proposed procedure is an application of a general se- 
quential rule for selecting the best of k populations which is treated in 
[1]. Proof that the probability specification is met and bounds on the 
probability of a correct decision can be found there. 


CASE 2: COMMON KNOWN g > 0 


In order to obtain the properties of the sequential procedure R; for 
this case it will be convenient to consider other sequential procedures. 
Let 6 = 1/4. — 1/6, so that, since the 6; are ordered, 8 = O. Let us 
assume that the experimenter can specify three constants a*, B* and 
P* such that a* > 1, B* > 0 and 4% < P* < 1 and a procedure is de- 
sired which will select the population with the largest scale parameter 
with probability at least P* whenever we have both 


aZza* and 6 = p* 


The following procedure meets this specification. 


Rule R;’: 
“Continue experimentation with replacement until the inequality 
‘ . 
Dy ok ETD BALL) <(1—P*)/P* (24) 
i=2 


is satisfied. Then stop and select the population with the smallest number 
of failures as the one having the largest scale parameter. If, at stopping 
time, two or more populations have the same value 7: then select that 
particular one of these with the largest Poisson life L1 .” 


Remarks 
1. For k = 2 the inequality reduces to 
(rz — 11) In a*® + (Ly — Le) B* = In [P*/(1 — P*)] (25) 


If g = O then Z, = JL; for all ¢ and the procedure R;’ reduces to R; . 
2. The procedure R;’ may terminate not only at failures but also be- 
tween failures. 
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3. The same inequality (24) can also be used if experimentation is 
carried on without replacement, one advantage of the latter being that 
there is less bookkeeping involved. In this case there is a possibility 
that the units will all fail before the inequality is satisfied so that the 
procedure is not yet completely defined for this case. One possibility 
in such a situation is to continue experimentation with new units from 
each population until the inequality is satisfied. Such a procedure will 
terminate in a finite time with probability one, i.e., Prob{7’ > 7} — 0 
as 7’) — ©, and the probability specification will be satisfied. 

4, A procedure R3’ (m1, m2, °°: , Mm, t,t, -°:, &) using the same 
inequality (24) but based on different initial sample sizes and/or on 
different starting times for the initial samples also satisfies the above 
probability specification. In the case of different starting times it is 
required that the experimenter wait at least g units of time after the last 
initial sample is put on test before reaching any decision. 

5. One disadvantage of R;’ is that there is some (however remote) 
possibility of terminating while 71 = r2. This can be avoided by adding 
the condition rz > 71 to (24) but, of course, the average experiment time 
is increased. Another way of avoiding this is to use the procedure R; 
which depends only on the number of failures; the effect of using R; 
when g > 0 will be considered below. 

6. The terms of the sum in (24) represent likelihood ratios. If at any 
time each term is less than unity then we shall regard the decision to 
select the population with 7; failures and LZ, units of Poisson life as opti- 
mal. Since (1 — P*)/P* < 1 then each term must be less than unity at 
termination. 


Properties of Procedure Rs’ fork = 2 


The OC and ASN functions for R;’ will be approximated by comparing 
R3’ with another procedure R;” defined below. We shall assume that P* 
is close to unity and that g is small enough (compared to 62) so that the 
probability of obtaining two failures within g units of time is small 
enough to be negligible. Then we can write approximately at termination 


L=nT-—rg G=1,2,-:-,k) (26) 
and ; 
TI — Lb: = (ri — rig (@ = 2,3,-°-,k) (27) 
Substituting this in (24) and letting 
d* = at P? (28) 


suggests a new rule, say R;”, which we now define. 
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Rule R3” 


“Continue experimentation with replacement until the inequality 


> oe <= = P*)/P* (29) 


i=2 
is satisfied. Then stop and select the population with 7; failures as the 
one with the largest scale parameter.” 


For rule R3” the experimenter need only specify P* and the smallest 
value 6* of the single parameter 


6 2)—- 
Ss a en (W)) p98 (30) 
2 


that he desires to detect with probability at least P*. 

We shall approximate the OC and ASN function of R;” for k = 2 
by computing them under the assumption that (27) holds at termina- 
tion. The results will be considered as an approximation for the OC and 
ASN functions respectively of R;’ for k = 2. The similarity of (29) 
and (6) immediately suggests that we might replace a* by 6* and a by 
5 in the formulae for (6). To use the resulting expressions for R;’ we 
would compute 6* as a function of a* and 6* by (28) and 6 as a function 
of a and 8 by (30). 

The similarity of (29) and (6) shows that Z, (defined in Reference 4, 
page 170) under (27) with g > 0 is the same function of 6* and 6 as it 
is of a* and a when g = 0. To complete the justification of the above 
result it is sufficient to show that the individual increment z of Z, is the 
same function of 5* and 6 under (27) with g > 0 as it is of a* and a 
when g = 0. To keep the increments independent it is necessary to as- 
sociate each failure with the Poisson life that follows rather than with 
the Poisson life that precedes the failure. Neglecting the probability 
that any two failures occur within g units of time we have two values for 
z, namely 


5 emt 9101 nt 182 

z = log + = —logé (31) 
n eo (nt9)102 nt 94 
Os 





and, interchanging @, and 6: , gives z = log 6. Moreover 
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[ fe N —(nz—4) [92,—nul0y dex dy 
Prob {z = —log 6} = 


92 —g{09(n—1) +010} /0,02 , 91 9 (89481 (n—1)]/0 82) 
—€ - € 
n + n (32) 
_ 6 
1+ 6 


Thus the OC and ASN functions under (27) with g > 0 bear the same 
relation to 6* and 6 as they do to a* and a when g = 0. Hence, letting 
w denote the smallest integer greater than or equal to 


_ In (P*/(. — P*)] _ In[P*/(1 — P*)] 
ie In 6* ~  gB* + Ina* 2 
we can write (omitting P* in the rule description) 
6” 


P{6; Rs (a*, B*)} = Pld; Rs” (8*)} & + 1 


(34) 


E{PF; Rs’ (a*, B*)} & ELF; Rs” (6*)} 


a uf tO al NE = ) for 6 >1 (35) 


w foré6 = 1 





We can approximate the average time between failures by 


rae eo NG OG A Be) af a ) 
Hrs 0,02, 9) % Te a oy S = ae ers ae (36) 


and the average experiment time by 





(g of Ad(g + 62) 
n(A, + 02 + 29) 87) 


Since 6 = 1 then 6°/(1 + 6”) is an increasing function of w and by 
(33) it is a non-increasing function of 6*. By (28) 6* 2 a* and hence, 
if we disregard the approximation (34), 


[P*/(l — P*)| /In a* 


Clearly the rules R3(a*, P*) and R;” (a*, P*) are equivalent so that 
for g > 0 we have 


E{T; Rs'(o*, B*)} & EP; Rs " a*, B*)} 


P{6; Rs(a*)} = P{6; Rs” (a*)} (39) 
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and hence, in particular, letting 6 = 6* in (38) we have 
P{5*; Rs(a*)} 2 P{s*; Rs” (5*)} 2 P* (40) 


since the right member of (34) reduces to P* when W is an integer and 
6 = 6*. The error in the approximations above can be disregarded when 
g is small compared to #2. Thus we have shown that for small values of 
g/62 the probability specification based on (a*, 6*, P*) is satisfied in the 
sense of (40) if we use the procedure R3(a*, P*), i.e., if we proceed as if 
g = 0. 

It would be desirable to show that we can proceed as if g = 0 for all 
values of g and P*. It can be shown that for sufficiently large n the rule 
R3(a*, P*) meets it specification for all g. One effect of increasing. n 
is to decrease the average time /(r) between failures and to approach 
the corresponding problem without replacement since g/EH(r) becomes 
large. Hence we need only show that R3(a*, P*) meets its specification 
for the corresponding problem without replacement. If we disregard the 
information furnished by Poisson life and rely solely on the counting of 
failures then the problem reduces to testing in a single binomial whether 
6 = 6, for population II, and 6 = 6 for population I. or vice versa. Let- 
ting p denote the probability that the next failure arises from I, then 
we have formally 


Ho:p 


versus Mp = 





eee eal 
lta 
For preassigned constants a* > 1 and P* (44 < P* < 1) the appropri- 
ate sequential likelihood test to meet the specification: 

‘“‘Probability of a Correct Selection 2 P* whenever a 2 a*” (41) 
then turns out to be precisely the procedure R3(a*, P*). Hence we may 
proceed as if g = 0 when nis sufficiently large. 

The specifications of the problem may be given in a different form. 
Suppose 6;* > 6o* are specified and it is desired to have a probability of a 
correct selection of at least P* whenever 6; = 0,;* > 6.* = 6.. Then we 
can form the following sequential likelihood procedure R;* which is 
more efficient than R3(a*, P*). 


Rule R3*: 


“Continue experimentation without replacement until a time ¢ is 
reached at which the inequality 


k t/8* —(ry—r1) * 
e€ —1 1—P 
dX | a — | Sp (42) 


i= 
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is satisfied. Then stop and select the population with r, failures as the 
population with 6 = 61’. 

It can be easily shown that the greatest lower bound of the bracketed 
quantity in (42) is 6:*/6.*. Hence for 6,*/@:* = a* and P* > 1% the time 
required by R;*(6:*, 62*, P*) will always be less than the time required 
by R3(a*, P*). 

Another type of problem is one in which we are given that @ = 6,* 
for one population and 6 = @,* for the & — 1 others where 6;* > 6.* are 
specified. The problem is to select the population with 6 = 0@,*. Then 
(42) can again be used. In this case the parameter space is discrete with 
k points only one of which is correct. If Rule R;* is used then the 
probability of selecting the correct point is at least P*. 


Equilibrium Approach When Failures Are Replaced 


Consider first the case in which all items on test are from the same 
exponential population with parameters (6, g). Let 7',; denote the length 

of the time interval between the 7" and the 7 + 18* failures, (7 = 0, 
"1, -+++), where n is the number of items on test and the 0" failure de- 
notes the starting time. As time increases to infinity the expected number 
of failures per unit time clearly approachesn/(@ + g) which is called the 
equilibrium failure rate. The inverse of this is the expected time between 
failures at equilibrium, say /(T,,,). The question as to how the quanti- 
ties H(T,,;) approach /(T,,,,) is of considerable interest in its own right. 
The following results hold for any fixed integer n 2 1 unless explicitly 
stated otherwise. It is easy to see that 











E(2m) Ss E(T,,.) S E(T 1) (43) 
since the exact values are respectively 
6 a) g + 6 @ 
1— < < “ 4 
n—1 ( n a ; a g+ n (44) 


In fact, since all units are new at starting time and since at the time of 
the first failure all units (except the replacement) have passed their 
guarantee period with probability one then 


If we compare the case g > O with the special case g = 0 we obtain 


sis 
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and if we compare it with the non-replacement case (g/@ is large) we 
obtain 
HG) (j=1,2,-++,2-1). (47) 
n— J 

These comparisons show that the difference in (46) is small when g/6 is 
small and for 7 < n the difference in (47) is small when g/@ is large. 

It is possible to compute /(T',;) exactly for g 2 0 but the computa- 
tion is extremely tedious for 7 2 2. The results for 7 = 1 and 0 are given 
in (44). Forj = 2 


BO Oe A) oe 


(48) 
+ n— 2 em Dald Tome ete) (n > 2) 
n—-1 n(n — 1) 
and 
HES apes [1 — 470 4 Pero) (49) 


For the case of two populations with a common guarantee period g 
we can write similar inequalities. We shall use different symbols a, b for 
the initial sample size from the populations with scale parameters 6; , 62 
respectively even though our principal interest is in the case a = b = n 
say. Let T..,; denote the interval between the j and 7 + 15* fail- 
ures in this case and let \; = 1/6; (¢ = 1, 2). We then have for all values 
of a and b 


[ada + bMS S E(Ta.s,3) S E(Ta3,0) 
=g+ [ai + dd” (Gf = 0,1,2,+-+, %) (50) 


(0. + g)(02 + g) 
a(O2 + g) + b(6: + g) on 


The result for E(T..,1) corresponding to that in (48) does not hold if 
the ratio 61/62 is too large; in particular it can be shown that 


—gl(a—1)Ay+bdA9] 
E(Tae.) = eS eee es | 
3 ayy + brd2/\(a — 1)A1 + Dre ani + bre 


Dro ) 1 ) Lp vee) 
tes 
oF (= + bAg. (= + (b = 1)r2 | an + bAs | 


is larger than E(T.,,,,) fora = b = 1 when g/@, = 0.01 and g/@ = 0.10 


ET 9,0) = 


(52) 
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so that 0,/@2 = 10. The expression (52) reduces to that in (44) if we set 
6; = 0. = 6 and replace a and b by n/2 in the resulting expression. 

Corresponding exact expressions for E(T..,;) for 7 > 1 are extremely 
tedious to derive and unwieldy although the integrations involved are 
elementary. If we let g — © then we obtain expressions for the non- 
replacement case which are relatively simple. They are best expressed 
as a recursion formula. 


aM 


E(Ta93) = ———— ET api 
na avi + bA fics 
1 2 a (53) 
2 . : > 
F Ce ET ap,j- pe y) 
an 1 
ET. ens ge 
( bi) ary; + br2 (a : 1)Ay + br» (54) 
2 Ss 
ae eeeg sve eres = PR 
E(T 09,3) S 9 + 4:/a forj 2aandj=0 (55) 
E(T 49,3) = 6:/(a —7) for 1 < j < a— | (56) 
Results similar to (55) and (56) hold for the case a = 0. The above 
results for g = © provide useful approximations for /(7Ta,»,;) when g 
is large. Upper bounds are given by 
E(Ta»,3) < [a + (6 — pr” (j = 1,2,---,b) (57) 
E(T 5, 5+) < [(a = pul (7 = i, 2, A ES 1). (58) 


Duration of the Experiment 


For the sequential rule R;’ with k = 2 we can now write down approxi- 
mations as well as upper and lower bounds to the expected duration 
E(T) of the experiment. From (50) 

E(F; 6) a 


+ [E@; 8) — E(T nn.) 


g + 


where c is the largest integer less than or equal to H(F; 6). The right ex- 
pression of (59) can be approximated by (53) and (54) if g is large. If 
c < 2n then the upper bounds are given by (57) and (58). A simpler 
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upper bound, which holds for all values of c is given by 


BP) $ HUF; ET ne) = BCP; 8) (9 + &) (60) 


CASE 3: COMMON UNKNOWN LOCATION PARAMETER g = 0 


In this case the more conservative procedure is to proceed under the 
assumption that g = 0. By the discussion above the probability require- 
ment will in most problems be satisfied for all g 2 0. The OC and ASN 
functions, which are now functions of the true value of g, were already 
obtained above. Of course, we need not consider values of g greater than 
the smallest observed lifetime of all units tested to failure. 


ADDENDUM 2 


For completeness it would be appropriate to state explicitly some of 
the formulas used in computing the tables in the early part of the paper. 
For the nonsequential, nonreplacement rule R; with k = 2 the proba- 
bility of a correct selection is 


Pla) = ff Sulu, ofela, 0) dy de (61) 
where 


file, 8) = = ORL — ey emer (r <n) (62) 


and C? is the usual combinatorial symbol. This can also be expressed in 
the form 


n\2 . (—1)"" 
P(a; Ri) = 1 — (rC Ss 
(a; Ri) 1 (r ees (63) 
{Brn — r+ 1 + a(n —r 47 


where B[x, y] is the complete Beta function. Equation (66) holds for 
any g = 0. 

For the rule R; the expected duration of the experiment for k = 2 
is given by 


wr) = f° xtfale, aL — F(x, 69] + fale, ODL — Fe, @)]} de (64) 


where f,(x, 8) is the density in (62) and F,(x, 6) is its c.d.f. This can 
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also be expressed in the form 


BeOS ye eee Oa. 165) 
. iaija(n—r+7lt+n—rt+ajtn-—r)p 


plus another similar expression in which 6;, @ are replaced by #2, a 
respectively. For g > 0 we need only add g to this result. This result 
was used to compute (7) in table 1A fora = landa = 2. Fora = © 
the expression simplifies to 


which can be shown to be equivalent to 


R(T) = & = —— 


jun—-jtil (67) 
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A Class of Binary Signaling Alphabets 


By DAVID SLEPIAN 
(Manuscript received September 27, 1955) 


A class of binary signaling alphabets called ‘‘group alphabets” is de- 
scribed. The alphabets are generalizations of Hamming’s error correcting 
codes and possess the following special features: (1) all letters are treated 
alike in transmission; (2) the encoding is simple to instrument; (3) maxi- 
mum likelihood detection is relatively simple to instrument; and (4) in 
certain practical cases there exist no better alphabets. A compilation is given 
of group alphabets of length equal to or less than 10 binary digits. 


INTRODUCTION 


This paper is concerned with a class of signaling alphabets, called 
“group alphabets,” for use on the symmetric binary channel. The class 
in question is sufficiently broad to include the error correcting codes of 
Hamming,’ the Reed-Muller codes,’ and all “systematic codes’”.’ On 
the other hand, because they constitute a rather small subclass of the 
class of all binary alphabets, group alphabets possess many important 
special features of practical interest. 

In particular, (1) all letters of the alphabets are treated alike under 
transmission; (2) the encoding scheme is particularly simple to instru- 
ment; (3) the decoder — a maximum likelihood detector — is the best 
possible theoretically and is relatively easy to instrument; and (4) in 
certain cases of practical interest the alphabets are the best possible 
theoretically. 

It has very recently been proved by Peter Elias‘ that there exist group 
alphabets which signal at a rate arbitarily close to the capacity, C, of 
the symmetric binary channel with an arbitrarily small probability of 
error. Elias’ demonstration is an existence proof in that it does not 
show explicitly how to construct a group alphabet signaling at a rate 
greater than C — e with a probability of error less than 6 for arbitrary 
positive 6 and e. Unfortunately, in this respect and in many others, our 
understanding of group alphabets is still fragmentary. 

In Part I, group alphabets are defined along with some related con- 
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cepts necessary for their understanding. The main results obtained up 
to the present time are stated without proof. Examples of these concepts 
are given and a compilation of the best group alphabets of small size 
is presented and explained. This section is intended for the casual reader. 

In Part II, proofs of the statements of Part I are given along with 
such theory as is needed for these proofs. 

The reader is assumed to be familiar with the paper of Hamming,’ 
the basic papers of Shannon’ and the most elementary notions of the 
theory of finite groups.° 


Part I — Group ALPHABETS AND THEIR PROPERTIES 
1.1 INTRODUCTION 


We shall be concerned in all that follows with communication over the 
symmetric binary channel shown on Fig. 1. The channel can accept 
either of the two symbols 0 or1. A transmitted 0 is received as a 0 with 
probability q and is received as a 1 with probability p = 1 — q: a trans- 
mitted 1 is received as a 1 with probability g and is received as a 0 with 
probability p. We assume 0 S p S 14. The “noise” on the channel 
operates independently on each symbol presented for transmission. The 
capacity of this channel is 


C=14 p logep + q logeg bits/symbol (1) 


By a K-letter, n-place binary signaling alphabet we shall mean a collec- 
tion of K distinct sequences of n binary digits. An individual sequence 
of the collection will be referred to as a letter of the alphabet. The integer 
K is called the size of the alphabet. A letter is transmitted over the 
channel by presenting in order to the channel input the sequence of n 
zeros and ones that comprise the letter. A detection scheme or detector for 


q 


INPUT OUTPUT 


q 


Fig. 1 — The symmetric binary channel. 
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a given K-letter, n-place alphabet is a procedure for producing a sequence 
of letters of the alphabet from the channel output. 

Throughout this paper we shall assume that signaling is accomplished 
with a given K-letter, n-place alphabet by choosing the letters of the 
alphabet for transmission independently with equal probability 1/K. 

Shannon’ has shown that for sufficiently large n, there exist K-letter, 
n-place alphabets and detection schemes that signal over the symmetric 
binary channel at a rate R > C — eé for arbitrary ¢ > 0 and such that 
the probability of error in the letters of the detector output is less than 
any 6 > 0. Here C is given by (1) and is shown as a function of p in 
Fig. 2. No algorithm is known (other than exhaustvie procedures) for 
the construction of K-letter, n-place alphabets satisfying the above 
inequalities for arbitrary positive 6 and ¢ except in the trivial cases C = 0 


and C = 1. 


1.2 THE GRouP B, 


There are a totality of 2” different n-place binary sequences. It is fre- 
quently convenient to consider these sequences as the vertices of a cube 
of unit edge in a Euclidean space of n-dimensions. For example the 5- 
place sequence 0, 1, 0, 0, 1 is associated with the point in 5-space whose 








Fig. 2— The capacity of the symmetric binary channel. 
= 1+ p loge p + (1 — p) log: (1 — p) 
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coordinates are (0, 1, 0, 0, 1). For convenience of notation we shall gen- 
erally omit commas in writing a sequence. The above 5-place sequence 
will be written, for example, 01001. 


We define the product of two n-place binury sequences, aid, +++ dy and 
bibe +++ bn as the n-place binary sequence 
Qtr, d+ be,-++,a.40, 


Here the a’s and b’s are zero or one and the + sign means addition 
modulo 2. (ThatisO +0= 1441 = 0, 0.26 Bh S01 se eS 1) 
For example, (01101) (00111) = 01010. With this rule of multiplication 
the 2” n-place binary sequences form an Abelian group of order 2”. 


The elements of the group, denoted by 7), T2,°-: , Tm, say, are the 
n-place binary sequences; the identity element I is the sequence 000 --- 0 
and 


IT; =TI=T7;; TT; = 7j;T:; TATiTs) = (PATH; ; 


the product of any number of elements is again an element; every ele- 
ment is its own reciprocal, 7; = T;, T; = I. We denote this group 
by B,. 

All subgroups of B, are of order 2° where & is an integer from the set 
0,1, 2,--:,n. There are exactly 


(2" = 22” as Pia Og = 2°) oo (2” aa oe) 
(2* <- 2°) (2 =. 21)(2* = 2") eee (2* —_ QF 1) (2) 
= N(n,n — k) 


N(n, k) = 
distinct subgroups of B, of order 2". Some values of N(n, k) are given in 


Table I. 


Tasie I — Somer Vauuses or N(n, k), tHE NuMBER OF SUBGROUPS 
or B, oF Orper 2”. N(n, k) = N(n, n — k) 


2 1 

3 1 7 7 1 

4 1 15 35 15 1 

5 1 31 155 155 31 1 

6 1 63 651 1395 651 63 

7 1 127 2667 11811 11811 2667 

8 1 255 10795 97155 200787 97155 

9 1 511 43435 788035 3309747 3309747 
10 1 1023 174251 6347715 53743987 109221651 
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1.3 GROUP ALPHABETS 


An n-place group alphabet is a K-letter, n-place binary signaling alpha- 
bet whose letters form a subgroup of B, . Of necessity the size of an 
n-place group alphabet is K = 2* where k is an integer satisfying 0 < 
k Sn. By an (n, k)-alphabet we shall mean an n-place group alphabet of 
size 2°. Example: the N(3, 2) = 7 distinct (3, 2)-alphabets are given by 
the seven columns 

(i) (ii) (iii) (iv) (v) (vi) (vii) 
000 000 000 000 000 000 000 
100 100 100 010 010 001 110 


010 001 O11 001 101 110 O11 (3) 
110 101 111 O11 111 111 101 


1.4 STANDARD ARRAYS 


Let the letters of a specific Ae k)-alphabet be Ay = J = 00--- 0, 
Az, As, °-:,A,, where p = 2”. The group B,, can be developed second: 
ing to this Sibetoup and its cosets: 
I, Ay ’ A3 ’ a) A, 
S2, SoAp , SoAg,+++ , SeAg 
S3 } S3Ae2 ; S343 poten ly S3A py 


Drs S,Aoz, S,As,-°°°, SAp 
p= 2", ne aa 


In this array every element of B, appears once and only once. The col- 
lection of elements in any row of this array is called a coset of the (n, k)- 
alphabet. Here S2 is any element of B, not in the first row of the array, 
S3 is any element of B, not in the first two rows of the array, etc. The 
elements S., S3;,---, S, appearing under J in such an array will be 
called the coset leaders. 

If a coset leader is replaced by any element in the coset, the same coset 
will result. That is to say the two collections of elements 


Ss ’ SjA2 ’ S;S3 has S:A, 
and 
SiAx , (S;Ap) Ao, (S;Az)A3,-++ (S:Ax)A, 


are the same. 
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We define the weight w; = w(T;) of an element, 7; , of B, to be the 
number of ones in the n-place binary sequence 7’; . 

Henceforth, unless otherwise stated, we agree in dealing with an ar- 
ray such as (4) to adopt the following convention: 


the leader of each coset shall be taken to be an 
element of minimal weight in that coset. 


(5) 


Such a table will be called a standard array. 
Example: Bz can be developed according to the (4, 2)-alphabet 0000, 
1100, 0011, 1111 as follows 


0000 1100 0011 1111 
1010 0110 1001 0101 6 
1110 0010 1101 0001 (6) 
1000 0100 1011 O111 


According to (5), however, we should write, for example 


0000 1100 +0011 
1010 0110 1001 0101 : 
0010 1110 0001 1101 (7) 
1000 0100 1011 111 


The coset leader of the second coset of (6) can be taken as any element 
of that row since all are of weight 2. The leader of the third coset, how- 
ever, should be either 0010 or 0001 since these are of weight one. The 
leader of the fourth coset should be either 1000 or 0100. 


1.5 THE DETECTION SCHEME 


Consider now communicating with an (n, k)-alphabet over the sym- 
metric binary channel. When any letter, say A;, of the alphabet is 
transmitted, the received sequence can be of any element of B,. We 
agree to use the following detector: 


af the received element of B, lies in column i of the array (4), the 
detector prints the letter A; ,7 = 1,2, +--+ , u. The array (4) ts to (8) 
be constructed according to the convention (8). 


The following propositions and theorems can be proved concerning 
signaling with an (n, k)-alphabet and the detection scheme given by (8). 


1.6 BEST DETECTOR AND SYMMETRIC SIGNALING 


- Define the probability £; = ¢(7';) of an element 7’; of B, to be €; = 
pg” °* where p and q are as in (1) and w; is the weight of 7;. Let 
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Q;,7 = 1, 2,--+, m be the sum of the probabilities of the elements in 
the 7th column of the standard array (4). . 
Proposition 1. The probability that any transmitted letter of the 
(n, k)-alphabet be produced correctly by the detector is Q: . 
Proposition 2. The equivocation® per symbol is 


1 B 
? H,(x) = —> 2, Qi logs Q: 


Theorem 1. The detector (8) is a maximum likelihood detector. That 
is, for the given alphabet no other detection scheme has a greater average 
probability that a transmitted letter be produced correctly by the de- 
tector. 

Let us return to the geometrical picture of n-place binary sequences 
as vertices of a unit cube in n-space. The choice of a K-letter, n-place 
alphabet corresponds to designating K particular vertices as letters. 
Since the binary sequence corresponding to any vertex can be produced 
by the channel output, any detector must consist of a set of rules that 
associates various vertices of the cube with the vertices designated as 
letters of the alphabet. We assume that every vertex is associated with 
some letter. The vertices of the cube are divided then into disjoint sets, 
Wi, We,-+-:, Wer where W; is the set of vertices associated with 7th 
letter of the signaling alphabet. A maximum likelihood detector is char- 
acterized by the fact that every vertex in W; is as close to or closer to 
the 7th letter than to any other letter, 7 = 1, 2, --- , K. For group alpha- 
bets and the detector (8), this means that no element in the 7th column 
of array (4) is closer to any other A than it is to A;,7 = 1, 2,--- ,y. 

Theorem 2. Associated with each (n, k)-alphabet considered as a point 
configuration in Euclidean n-space, there is a group of n X n orthogonal 
matrices which is transitive on the letters of the alphabet and which 
leaves the unit cube invariant. The maximum likelihood sets Wi , 
We,-:: W, are all geometrically similar. 

Stated in loose terms, this theorem asserts that in an (n, k)-alphabet 
every letter is treated the same. Every two letters have the same number 
of nearest neighbors associated with them, the same number of next 
nearest neighbors, etc. The disposition of points in any two W regions 
is the same. 


1.7 GROUP ALPHABETS AND PARITY CHECKS 


Theorem 3. Every group alphabet is a systematic’ code: every syste- 
matic code is a group alphabet.’ 
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We prefer to use the word “alphabet” in place of “code” since the 
latter has many meanings. In a systematic alphabet, the places in any 
letter can be divided into two classes: the information places — k in 
number for an (n, k)-alphabet — and the check positions. All letters 
have the same information places and the same check places. If there 
are k information places, these may be occupied by any of the 2" k-place 
binary sequences. The entries in the n — k check positions are fixed‘ 
linear (mod 2) combinations of the entries in the information positions. 
The rules by which the entries in the check places are determined are 
called parity checks. Examples: for the (4, 2)-alphabet of (6), namely 
0000, 1100, 0011, 1111, positions 2 and 3 can be regarded as the informa- 
tion positions. If a letter of the alphabet is the sequence aja2a3a,4 , then 
di = G2, G4 = 3 are the parity checks determining the check places 1 
and 4. For the (5, 3)-alphabet 00000, 10001, 01011, 00111, 11010, 10110, 
01100, 11101 places 1, 2, and 3 (numbered from the left) can be taken 
as the information places. If a general letter of the alphabet is a,a.aza,a5 , 
then a, = a2 + a3, a = a1 + a2 + a3. 

Two group alphabets are called equivalent if one can be obtained from 
the other by a permutation of places. Example: the 7 distinct (8, 2)- 
alphabets given in (3) separate into three equivalence classes. Alpha- 
bets (i), (ii), and (iv) are equivalent; alphabets (iii), (v), (vi), are equiva- 
lent; (vii) is in a class by itself. 

Proposition 8. Equivalent (n, k)-alphabets have the same probability 
Q: of correct transmission for each letter. 

Proposition 4. Every (n, k)-alphabet is equivalent to an (n, k)- 
alphabet whose first & places are information places and whose last n — k 
places are determined by parity checks over the first k places. 

Henceforth we shall be concerned only with (n, k)-alphabets whose 
first k places are information places. The parity check rules can the 
be written 


k 
a= Dygj, t= k+1,--+,0 (9) 
j=1 


where the sums are of course mod 2. Here, as before, a typical letter of 
the alphabet is the sequence aia2 +--+ a, . The y:; are k(n — k) quantities, 
zero or one, that serve to define the particular (n, k)-alphabet in question. 


1.8 MAXIMUM LIKELIHOOD DETECTION BY PARITY CHECKS 


For any element, 7’, of B, we can form the sum given on the right of 
(9). This sum may or may not agree with the symbol in the ith place of 
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T. If it does, we say T satisfies the zth-place parity check; otherwise T — 
fails the zth-place parity check. When a set of parity check rules (9) is 
given, we can associate an (n — k)-place binary sequence, R(7’), with 
each element 7 of B,. We examine each check place of T in order starting 
with the (k + 1)-st place of T. We write a zero if a place of 7' satisfies 
the parity check; we write a one if a place fails the parity check. The re- 
sultant sequence of zeros and ones, written from left to right is R(T). 
We call R(T) the parity check sequence of T. Example: with the parity 
rules ag = a2 + a3, d5 = a1 + a2 + as used to define the (5, 3)-alphabet 
in the examples of Theorem 3, we find R(11000) = 10 since the sum of 
the entries in the second and third places of 11001 is not the entry of 
the fourth place and since the sum of a1 = 1, a = 1, and a; = 0 is 
0 = asp. ‘ 

Theorem 4. Let I, Ao, +--+: A, be an (n, k)-alphabet. Let R(T) be the 
parity check sequence of an element 7 of B, formed in accordance with 
the parity check rules of the (n, k)-alphabet. Then R(71) = R(T?) if 
and only if 7; and T> lie in the same row of array (4). The coset leaders 
can be ordered so that R(S;) is the binary symbol for the integer 7 — 1. 

As an example of Theorem 4 consider the (4, 2)-alphabet shown with 
its cosets below 


0000 1011 0101 1110 
0100 1111 0001 1010 
0010 1001 0111 1100 
1000 0011 1101 0110 


The parity check rules for this alphabet are a3 = a1, a1 = a1 + a. 
Every element of the second row of this array satisfies the parity check 
in the third place and fails the parity check in the 4th place. The parity 
check sequence for the second row is 01. The parity check for the third 
row is 10, and for the fourth row 11. Since every letter of the alphabet 
satisfies the parity checks, the parity check sequence for the first row is 
00. We therefore make the following association between parity check 
sequences and coset leaders 


00 — 0000 = 8S, 
01 — 0100 = & 
10 — 0010 = 8 
11 — 1000 = Sy 


1.9 INSTRUMENTING A GROUP ALPHABET 


Proposition 4 attests to the ease of the encoding operation involved 
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with the use of an (n, k)-alphabet. If the original message is presented as 
a long sequence of zeros and ones, the sequence is broken into blocks of 
length k places. Each block is used as the first k places of a letter of 
the signaling alphabet. The last n-k places of the letter are determined 
by fixed parity checks over the first k places. 

Theorem 4 demonstrates the relative ease of instrumenting the maxi- 
mum likelihood detector (8) for use with an (n, k)-alphabet. When an 
element 7 of B, is received at the channel output, it is subjected to the 
n-k parity checks of the alphabet being used. This results in a parity 
check sequence R(T). R(T) serves to identify a unique coset leader, say 
S;. The product S;7 is then formed and produced as the detector out- 
put. The probability that this be the correct letter of the alphabet is Q: . 


1.10 BEST GROUP ALPHABETS 


Two important questions regarding (n, k)-alphabets naturally arise. 
What is the maximum value of Q: possible for a given n and k and which 
of the N(n, k) different subgroups give rise to this maximum (,? The 
answers to these questions for general n and k are not known. For many 
‘special values of n and k the answers are known. They are presented in 
Tables II, III and IV, which are explained below. 

The probability Q, that a transmitted letter be produced correctly by 
the detector is the sum, Qi: = )_j £(S,) of the probabilities of the coset 
leaders. This sum can be rewritten as Qi = > imo a: p'g” * where a; is 
the number of coset leaders of weight 7. One has, of course, )oa; = v = 


2” for an (n, k)-alphabet. Also a; S eo - ~! since this is the 


= ~ al(n — 2) 
number of elements of B,, of weight 7. 

The a; have a special physical significance. Due to the noise on the 
channel, a transmitted letter, A; , of an (n, k)-alphabet will in general be 
received at the channel output as some element 7 of B, different from 
A;.If T differs from A; in s places, i.e., if w(A;T) = s, we say that an 
- s-tuple error has occurred. For a given (n, k)-alphabet, a; is the number 
of 7-tuple errors which can be corrected by the alphabet in question, 


10D ares ay, 
Table IT gives the a; corresponding to the largest possible value of Qi 
for a given k and n fork = 2,3, -:-n—1,n =4--- , 10 along witha 


few other scattered values of n and k. For reference the binomial coeffi- 
cients & are also listed. For example, we find from Table II that the 


best group alphabet with 2* = 16 letters that uses n = 10 places has a 
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probability of correct transmission Q: = g”° + 10q°p + 39q°p" + 14q'p*. 
The alphabet corrects all 10 possible single errors. It corrects 39 of the 
possible () = 45 double errors (second column of Table II) and in 
addition corrects 14 of the 120 possible triple errors. By adding an addi- 
tional place to the alphabet one obtains with the best (11, 4)-alphabet 
an alphabet with 16 letters that corrects all 11 possible single errors and 
all 55 possible double errors as well as 61 triple errors. Such an alphabet 
might be useful in a computer representing decimal numbers in binary 
form. 

For each set of a’s listed in Table II, there is in Table III a set of 
parity check rules which determines an (n, k)-alphabet having the given 
a’s. The notation used in Table III is best explained by an example. A 
(10, 4)-alphabet which realizes the a’s discussed in the preceding para- 
graph can be obtained as follows. Places 1, 2, 3, 4 carry the information. 
Place 5 is determined to make the mod 2 sum of the entries in places 
3, 4, and 5 equal to zero. Place 6 is determined by a similar parity check 
on places 1, 2, 3, and 6; place 7 by a check on places 1, 2, 4, and 7, ete. 

It is a surprising fact that for all cases investigated thus far an (n, k)- 
alphabet best for a given value of p is uniformly best for all values of 
p,0 S$ p S 14. It is of course conjectured that this is true for all n and k. 

It is a further (perhaps) surprising fact that the best (n, &)-alphabets 
are not necessarily those with greatest nearest neighbor distance be- 
tween letters when the alphabets are regarded as point configurations on 
the n-cube. For example, in the best (7, 3)-alphabet as listed in Table 
III, each letter has two nearest neighbors distant 3 edges away. On the 
other hand, in the (7, 3)-alphabet given by the parity check rules 418, 
512, 623, 7123 each letter has its nearest neighbors 4 edges away. This 
latter alphabet does not have as large a value of Q,, however, as does 
the (7, 3)-alphabet listed on Table III. 

The cases k = 0, 1, n — 1, n have not been listed in Tables II and ITI. 
The cases k = 0 and k = n are completely trivial. For k = 1, alln > 1 
the best alphabet is obtained using the parity rule a2 = ag = +--+ = 
Gd, = lin = 2), 

SS WY chee AIRY 33 2. (n\ 4 oni 
Qi = = (7) oa “45 ) py. lin = 2+1,Q= 2 (") py. 
o \t 24 0 v) 
Fork = n — 1,n > 1, the maximum Q, is Q, = g” and a parity rule 
for an alphabet realizing this Qi is a, = a1. 


If the a’s of an (n, k)-alphabet are of the form a; = a t= 0,515 
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‘Taste II — Prosasiuiry or No Error witn Best 
ALPHABETS, Q: = >) apg” ’ 
ny |k=2;k=3|k=4|kh=5])k=6|k=7|k=8|k=9|k=10 
i 7) ay a; a; ay ai aj ay aj a; 
n=4 0 1 1 
1 4 3 
0 1 1 1 
n= 5 1 5 5 3 
2), 40 2 
0 1 1 1 1 
n= 6 1 6 6 6 3 
2 15 9 1 
0 1 1 1 1 1 
n=7 1 7 7 7 7 3 
al ae 18 8 
3 25 6 
0 1 1 1 1 1 1 
n=8 1 8 8 8 8 7 3 
2 28 28 20 7 
3 56 27 3 
0 1 1 1 1 1 1 1 
1 9} 9 9 9 9 7 3 
n=9 2 36 36 33 22 6 
3 84 64 21 
4 ) 126 18 
0 1 1 1 1 1 1 1 1 
1 10 10 10 10 10 10 7 3 
n = 10 2 45.) 45 45 39 21 5 
3 | 120 | 110 64 14 
4] 210} 90! 8 
0 1 i 1 1 1 1 1 1 
1 11 11 11 11 1] 11 7 3 
n= 11 2 5d 55 55 55 ‘20 4 
3 | 165 | 165 | 126 61 
4 | 330 | 226 63 
5 | 462 | - 54 
0 1 1 1 1 1 1 1 
1 12-3) 12% = 12 12 | 7 3 
n= 12 2 66 66 66 19 3 
3 | 220 | 220 | 200 
4 | 495 | 425 | 233 
5 | 792 | 300 
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2,°°* 595, @j41 = 7 some integer, ajyo = ajas = *°* = On = O, then 
there does not exist a 2"-letter, n-place alphabet of any sort better than 
the given (n, k)-alphabet. It will be observed that many of the a’s of 
Table II are of this form. It can be shown that 
Proposition 6 if n + & 5 *) + « i ®) > 2” — | there exists 
no 2*-letter, n-place alphabet better than the best (n, &)-alphabet. 
When the inequality of proposition 5 holds the a’s are either ay = 1, 


a = 2”* — 1, all other a = 0; ora = 1, a = peor? -1- 


n 
1 
(") all other a = 0; or the trivial ag = 1 all other a = O which holds 
when k = n. The region of the n — k plane for which it is known that 
(n, k)-alphabets cannot be excelled by any other is shown in Table IV. 


1.11 A DETAILED EXAMPLE 


As an example of the use of (n, k)-alphabets consider the not un- 
realistic case of a channel with p = 0.001, i.e., on the average one binary 
digit per thousand is received incorrectly. Suppose we wish to transmit 
messages using 32 different letters. If we encode the letters into the 32 
5-place binary sequences and transmit these sequences without further 
encoding, the probability that a received letter be in error is 1 — 
(1 — p) = 0.00449. If the best (10, 5)-alphabet as shown in Tables II 
and III is used, the probability that a letter be wrong is 1 — Qi: = 
1 — ¢° — 10q’p — 21g'p? = 24p° — 72p° + --- = 0.000024. Thus 
by reducing the signaling rate by 14, a more than one hundredfold re- 
duction in probability of error is accomplished. : 

A (10, 5)-alphabet to achieve these results is given in Table III. Let 
a typical letter of the alphabet be the 10-place sequence of binary digits 
M02 +++ Adio. The symbols aya,a3a,a3 carry the information and can be 
any of 32 different arrangements of zeros and ones. The remaining places 
are determined by 


de = 1 + a3 + as + as 
a@=atata+as : 
dg = a1 + a2 + a3 + as 
=a +a+tast a 

dio = G1 + a2 + a3 + ay + as 


To design the detector for this alphabet, it is first necessary to deter- 
mine the coset leaders for a standard array (4) formed for this alphabet. 


Tasue III — Pariry Cuecx RvuuLes ror Best ALPHABETS 


k = 10 
































ao 
io oon) 
Ray 
ll HH oD 
RS CONN 
| moe 
I- © ~oOoc 
In ut > Ld = 
me CONN CONNN 
aH aod aad 
Ol Ol- Oman co 
=H 
= cm =H “HH oo aH <tH 00 09 aH sto 09 09 
q aN NN ONAN OANA 
Pi ee i el aaa 
wdc anion i ud <> b= OO uD Ob CO 
nD OD OD ine) 
ll NOD N OD OD OCONNN NOD COON NOON 
“£2 rtrd mre N Sono oon ool mr IN mNnRNe 
tH 1d aH iM cD Haignw i Tag Oo i~ CD Hid cory Der 
a ANN i) N AN 
i NA ANG AAAs ARB RN FANN RBA YH 
me OO =H CO shld OD Hid cD OO =H ld <P oo Hd cor CD OO SHAd cor COG 
tt us © I~ DH oC 
ll i ll ll Il II 
Ss = [4 = = = 











91 
101 


NN 
mM NANA SS 
OO SHAD CO CO Co rt 


N 
-e 
So 


i= 
fn! 


Il 
= 


tH 
<H on 

Too 

CO HH OD OD OI 
FAANAS, 


WOm~OOa eS 


On OD 


OOAN 
NANO a as 


ANOS alg 
Hin Or OD errs 


AN NAN 


rtrd er 
On WAAANNN oA 


OD Han OM OO reir COHN Ol DH Oar 


+ 
a 
ll 
S 


i] 


n 


217 


218 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 


TasBLe [V — REGION oF THE n-k PLANE FOR WHICH IT IS KNOWN 
THAT (n, k)-ALPHABETS CANNoT Br EXCELLED 





oe ec e e@ 
oe e ee @ 
oe e¢ e@ e@ @ 


oe ee e @ @ 
oc eeie 
oe @ @ @ 
eee e © © © © © © © © © © © © © © © © © © ee ee he ee 


oe eee © © © © © © @ © © © © © © © © © © © © © © © © © © © 


012345678910 12 14 16 18 20 22 24 26 28 30n 


This can be done by a variety of special methods which considerably 
reduce the obvious labor of making such an array. A set of best S’s along 
with their parity check symbols is given in Table V. 

A maximum likelihood detector for the (10, 5)-alphabet in question 
forms from each received sequence bbe --+ bio the parity check symbol 
€1C2C3C4Cs Where 


[a 


a=b +h +b; + b4 + bs 
Co = by + by + be + bs + Ds 
cs = bg -- bi + be + bs + Ds 
ca = do + bi + be + bs + by 
C5 Di0 ae by ste De te bs oh by =f bs 
According to Table V, if cicecscacs contains less than three ones, the de- 


tector should brint bibebsb4bs . The detector should print (6: + 1)bebsbabs 
if the parity check sequence cy¢s¢3¢4cs is either 11111 or 11110; the de- 
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TaBLE V — Coset LEADERS AND Parity CHECK SEQUENCES 
For (10,5)-ALPHABET 








C1C2C3C4C5 oO S C1C20304¢5 —- S 

00000 0000000000 11100 0000100001 
10000 0000010000 11010 0001000001 
01000 0000001000 11001 0001000010 
00100 0000000100 10110 0010000001 
00010 0000000010 10101 0010000010 
00001 0000000001 10011 0010000100 
11000 0000011000 01110 0100000001 
10100 0000010100 01101 0100000010 
10010 0000010010 01011 0100000100 
10001 0000010001 00111 0100001000 
01100 0000001100 11110 1000000001 
01010 0000001010 11101 0000100000 
01001 0000001001 11011 0001000000 
00110 0000000110 10111 0010000000 
00101 0000000101: 01111 0100000000 
00011 0000000011 11111 1000000000 


tector should print b1(b2 + 1)bsbybs if the parity check sequence is 01111, 
~ 00111, 01011, 01101, or 01110; the detector should print bibo(bs + 1)babs 
if the parity check sequence is 10111, 10011, 10101, or 10110; the de- 
tector should print bibeb3(b4 + 1)bs if the parity check sequence is 11011, 
11001, 11010; and finally the detector should print bibsbzbs(bs + 1) if the 
parity check sequence is 11101 or 11100. 

Simpler rules of operation for the detector may possibly be obtained 
by choice of a different set of S’s in Table V. These quantities in general 
are not unique. Also there may exist non-equivalent alphabets with 
simpler detector rules that achieve the same probability of error as the 
alphabet in question. 


Part If — AppirionaL THEORY AND PROOFS OF THEOREMS OF Part I 
2.1 THE ABSTRACT GROUP C, 


It will be helpful here to say a few more words about B, , the group 
of n-place binary sequences under the operation of addition mod 2. This 
group is simply isomorphic with the abstract group C, generated by n 
commuting elements of order two, say ai, d2,°--: , @,. Here aa; = 
a,a; anda; = I,i,7 = 1, 2,--:, , where J is the identity for the 
group. The eight distinct elements of C3 are, for example, J, a1, a, 
Gz , Aig , A103 , AeG3 , Aid2a3. The group C’, is easily seen to be isomorphic 
with the n-fold direct product of the group Ci with itself. 
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It is a considerable saving in notation in dealing with C, to omit the 
symbol ‘‘a”’ and write only the subscripts. In this notation for example, 
the elements of C, are I, 1, 2, 3, 4, 12, 18, 14, 23, 24, 34, 123, 124, 134, 
234, 1234. The product of two or more elements of C’, can readily be 
written down. Its symbol consists of those numerals that occur an odd 
number of times in the collection of numerals that comprise the sym- 
bols of the factors. Thus, (12)(284)(123) = 24. 

The isomorphism between C, and B, can be established in many ways. 
The most convenient way, perhaps, is to associate with the element 
1ytel3 +++ 1, Of C, the element of B, that has ones in places 71, 12, +++ , t 
and zeros in the remaining n — k places. For example, one can associate 
124 of Cy with 1101 of B, ; 14 with 1001, etc. In fact, the numeral no- 
tation afforded by this isomorphism is a much neater notation for B, 
than is afforded by the awkward strings of zeros and ones. There are, 
of course, other ways in which elements of C,, can be paired with elements 
of B, so that group multiplication is preserved. The collection of all such 
“pairings” makes up the group of automorphisms of C,. This group of 
automorphisms of C’, is isomorphic with the group of non-singular linear 
homogenous transformations in a field of characteristic 2. 

An element T of C, is said to be dependent upon the set of elements 
T,, T2, ++: , T; of C, if T can be expressed as a product of some ele- 
ments of the set 71, T2, --- , T;; otherwise, 7’ is said to be independent 
of the set. A set of elements is said to be independent if no member can 
be expressed solely in terms of the other members of the set. For example, 
in Cs, 1, 2, 3, 4 form a set of independent elements as do likewise 2357, 
123857, 14. However, 185 depends upon 145, 3457, 57 since 185 = 
(145) (3457) (57). Clearly any set of m independent elements of C, can 
be taken as generators for the group. For example, all possible products 
formed of 12, 123, and 23 yield the elements of C3. 

Any k independent elements of C,, serve as generators for a subgroup 
of order 2°. The subgroup so generated is clearly isomorphic with C; . 
All subgroups of C,, of order 2° can be obtained in this way. 

The number of ways in which k independent elements can be chosen 
from the 2” elements of C,, is 


F(n, k) = (2” — 2°)(2" — 2')(2” — 2°) --- (2% — 2) 


For, the first element can be chosen in 2” — 1 ways (the identity cannot 
be included in a non-trivial set of independent elements) and the second 
element can be chosen in 2” — 2 ways. These two elements determine a 
subgroup of order 2”. The third element can be chosen as any element of 
the remaining 2” — 2” elements. The 3 elements chosen determine a 
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subgroup of order 2°. A fourth independent element can be chosen as 
any of the remaining 2” — 2° elements, etc. 

Each set of & independent elements serves to generate a subgroup of 
order 2". The quantity F(n, k) is not, however, the number of distinct 
subgroups of C, of this order, for, a given subgroup can be obtained 
from many different sets of generators. Indeed, the number of different 
sets of generators that can generate a given subgroup of order 2” of C, 
is just F'(k, k) since any such subgroup is isomorphic with C; . Therefore 
_ the number of subgroups of C,, of order 2" is N(n, k) = F(n, k)/F(k, k) 
which is (2). A simple calculation gives N(n, k) = N(n,n — k). 


2.2 PROOF OF PROPOSITIONS 1 AND 2 


After an element A of B, has been presented for transmission over 
a noisy binary channel, an element 7 of B, is produced at the channel 
output. The element U = AT of B, serves as a record of the noise 
during the transmission. U is an n-place binary sequence with a one at 
each place altered in A by the noise. The channel output, 7’, is obtained 
from the input A by multiplication by U: T = UA. For channels of the 
sort under consideration here, the probability that U be any particular 
element of B, of weight w is p°g” ”. 

Consider now signaling with a particular (n, k)-alphabet and consider 
the standard array (4) of the alphabet. If the detection scheme (8) is 
used, a transmitted letter A; will be produced without error if and only 
if the received symbol is of the form S;A;. That is, there will be no 
error only if the noise in the channel during the transmission of A; is 
represented by one of the coset leaders. (This applies forz = 1, 2, --- , 
p = 2"). The probability of this event is Q, (Proposition 1, Section 1.6). 
The convention (5) makes Q, as large as is possible for the given alpha- 
bet. 

Let X refer to transmitted letters and let Y refer to letters produced 
by the detector. We use a vertical bar to denote conditions when writing 
probabilities. The quantity to the right of the bar is the condition. We 
suppose the letters of the alphabet to be chosen independently with 
equal probability 27”. 

The equivocation h(X | Y) obtained when using an (n, k)-alphabet 
with the detector (8) can most easily be computed from the formula 


A(X | ¥) = hCX) — h(Y) + AY |X) (10) 


The entropy of thesourceish(X) = k/n bits per symbol. The probability 
that the detector produce A; when A; was sent is the probability that 
the noise be represented by A;:A;S., € = 1, 2, --- , ». In symbols, 
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Pr(¥Y — A;|X > A) = >> Pr(N > A,A;8) = Q(A:A,) 
rf 


where @Q(A,;) is the sum of the probabilities of the elements that are in 
the same column as A; in the standard array. Therefore 


PY > As) = D PAY 3 As |X > AYP > Ad = DQAAY 


| bes 


7 since D7 (AA) = 2) Q(Ad) = 1. 


bo 


This last follows from the group property of the alphabet. Therefore 
1 ae 
h(Y) = ae > Pr(¥ — A;) log Pr(¥Y > A;) = 7 bits/symbol. 
Tt follows then from (10) that 
h(X | Y) = h(Y | X) 


The computation of h(Y | X) follows readily from its definition 
n(Y |X) = 2) Pr(X 3 AdAY | X > Ai) 


ll 


—>> Pr(X > A) Pr(¥ > A; |X > Ad) 
ij 


log Pr(Y > A; | X — Aj) 


I 


-% Ds > Pr(N — AjS8A;) log >> Pr(N — AiSnA;) 
tj m 


-% »» Q(A:A;) log Q(A:A)) 


> Q(A;) log Q(A;) 


Each letter is m binary places. Proposition 2, then follows. 


2.38 DISTANCE AND THE PROOF OF THEOREM 1 


Let A and B be two elements of B, . We define the distance, d(A, B), 
between A and B to be the weight of their product, 


- d(A, B) = w(AB) a) 


The distance between A and B is the number of places in which A and 
B differ and is just the “Hamming distance.” * In terms of the n-cube, 
d(A, B) is the minimum number of edges that must be traversed to go 
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from vertex A to vertex B. The distance so defined is a monotone fune- 
tion of the Euclidean distance between vertices. 
It follows from (11) that if C is any element of B,, then 


d(A, B) = d(AC, BC) (12) 


This fact shows the detection scheme (8) to be a maximum likelihood 
detector. By definition of a standard array, one has 


d(S;, J) S d(S,A;, I) for all 7 and j 
The coset leaders were chosen to make this true. From (12), 
d(8S:,D = d(S:AnS;, I AnS:) = d(SiAm, Am) 
d(S;A;, 2) = d(S.AjSiAn, I SiAm) = d(AjAn, SiAm) 
d(SiAm , Az) 


where A, = A;A,,. Substituting these expressions in the inequality 
above yields 


d(SiAm, Am) S d(SiAn, Aa) for all 2, m, ¢ 


This equation says that an arbitrary element in the array (4) is at least 
as close to the element at the top of its column as it is to any other letter 
of the alphabet. This is the maximum likelihood property. 


2.4 PROOF OF THEOREM 2 


Again consider an (n, k)-alphabet as a set of vertices of the unit n-cube. 

Consider also n mutually perpendicular hyperplanes through the cen- 
troid of the cube parallel to the coordinate planes. We call these planes 
“symmetry planes of the cube” and suppose the planes numbered in 
accordance with the corresponding parallel coordinate planes. 
_ The reflection of the vertex with coordinates (a1, @2, --* ,@i, °** , Qn) 
in symmetry plane 7 yields the vertex of the cube whose coordinates 
are (a1, @2, --: , @; + 1,---, an). More generally, reflecting a given 
vertex successively in symmetry planes 2, 7, k, --- yields a new vertex 
whose coordinates differ from the original vertex precisely in places 
1,7, k «++ . Successive reflections in hyperplanes constitute a transfor- 
mation that leaves distances between points unaltered and is therefore 
a “rotation.’? The rotation obtained by reflecting successively in sym- 
metry planes 2, 7, k, etc. can be represented by an n-place symbol having 
a one in places 2, 7, k, etc. and a zero elsewhere. 

We now regard a given (n, k)-alphabet as generated by operating on 
the vertex (0, 0, --- , 0) of the cube with a certain collection of 2” ro- 
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tation operators. The symbols for these operators are identical with the 
sequences of zeros and ones that form the coordinates of the 2” points. 
It is readily seen that these rotation operators form a group which is 
transitive on the letters of the alphabet and which leave the unit cube 
invariant. Theorem 2 then follows. 

Theorem 2 also follows readily from consideration of the array (4). 
For example, the maximum likelihood region associated with J is the 
set of points J, S2, S3, --- , S,. The maximum likelihood region asso- 
ciated with A; is the set of points A;, AiS2, AiS3, --- , A:S,. The 
rotation (successive reflections in symmetry planes of the cube) whose 
symbol is the same as the coordinate sequence of A; sends the maximum 
likelihood region of J into the maximum likelihood region of A;, 7 = 
12,°++, pm 


2.5 PROOF OF THEOREM 3 


That every systematic alphabet is a group alphabet follows trivially 
from the fact that the sum mod 2 of two letters satisfying parity checks 
is again a letter satisfying the parity checks. The totality of letters satis- 
fying given parity checks thus constitutes a finite group. 

To prove that every group alphabet is a systematic code, consider 
the letters of a given (n, k)-alphabet listed in a column. One obtains in 
this way a matrix with 2" rows and n columns whose entries are zeros 
and ones. Because the rows are distinct and form a group isomorphic to 
C;,, there are k linearly independent rows (mod 2) and no set of more 
than k independent rows. The rank of the matrix is therefore k. The 
matrix therefore possesses k linearly independent (mod 2) columns and 
the remaining n — k columns are linear combinations of these k. Main- 
taining only these k linearly independent columns, we obtain a matrix of 
k columns and 2" rows with rank k. This matrix must, therefore, have & 
linearly independent rows. The rows, however, form a group under mod 
2 addition and hence, since & are linearly independent, all 2° rows must 
be distinct. The matrix contains only zeros and ones as entries; it has 2” 
distinct rows of k entries each. The matrix must be a listing of the num- 
bers from 0 to 2° — 1 in binary notation. The other n — k columns of 
the original matrix considered are linear combinations of the columns of 
this matrix. This completes the proof of Theorem 3 and Proposition 4. 


2.6 PROOF OF THEOREM 4 


To prove Theorem 4 we first note that the parity check sequence of 
the product of two elements of B, is the mod 2 sum of their separate 
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parity check sequences. It follows then that all elements in a given coset 
have the same parity check sequence. For, let the coset be S;, S:iAz, 


S:A3, --- S:A,. Since the elements J, A, , A3, --- , A, all have parity 
check sequence 00 --- 0, all elements of the coset have parity check 
R(S;). 


In the array (4) there are 2” cosets. We observe that there are 2” * 
elements of B, that have zeros in their first k places. These elements 
have parity check symbols identical with the last n — k places of their 
symbols. These elements therefore give rise to 2” different parity check 
symbols. The elements must be distributed one per coset. This proves 
Theorem 4. 


2.7 PROOF OF PROPOSITION 5 


n—k n—k n—k 
ae -( 2 )-( 3 )-a 


we can explicity exhibit group alphabets having the property mentioned 
in the paragraph preceding Proposition 5. The notation of the demon- 
stration is cumbersome, but the idea is relatively simple. 

We shall use the notation of paragraph 2.1 for elements of B, , ie., 
an element of B, will be given by a list of integers that specify what 
places of the sequence for the element contain ones. It will be convenient 
furthermore to designate the first k places of a sequence by the integers 
1, 2,3, >: , k and the remaining n — k places by the “integers” 1’, 2’, 
3’, --- , &’, where ¢ = n — k. For example, ifn = 8, k = 5, we have 

10111010 <— 13452’ 


10000100 <> 11’ 
00000101 <> 1’3’ 


If 


Consider the group generated by the elements 1’, 2’, 3’, --- , ¢’, ie. 
the 2° elements J, 1’, 2’, «++ , ¢’, 1/2’, 18’, --- , 1'2’3’ --- @’. Suppose 
these elements listed according to decreasing weight (say in decreasing 
order when regarded as numbers in the decimal system) and numbered 
consecutively. Let B; be the 7th element in the list. Example: if £ = 3, 
Bes 123) by = 29, Bs oe 85 Bes 12 Be = 35 Be =H 2 Be eS 1: 

Consider now the (n, k)-alphabet whose generators are 


1B, , 2Bo, 3B;, ye , KB, 
We assert that if 
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3 nk n—k ne n—-k 
Oa Gey ie ; 


this alphabet is as good as any other alphabet of 2” letters and n places. 
In the first place, we observe that every letter of this (n, k)-alphabet 
(except J) has unprimed numbers in its symbols. It follows that each of 
the 2° letters I, 1’, 2’, --- , €’, 1/2’, «++ , 12’ ++ € occurs in a different 
coset of the given (n, k)-alphabet. For, if two of these letters appeared 
in the same coset, their product (which contains only primed numbers) 
would have to be a letter of the (n, k) alphabet. This is impossible since 
every letter of the (n, k) alphabet has unprimed numbers in its symbol. 
Since there are precisely 2‘ cosets we can designate a coset by the single 
element of the list Bi, Bs, --- , Byt¢ = I which appears in the coset. 
We next observe that the condition 


axk n—k n—k 
meet (34)-(5)-1 


guarantees that By41 is of weight 3 or less. For, the given condition is 


equivalent to 
£ £ £ £ 
= ok = = 
#2 2 (9)-(¢)-(@)-(3) 


We treat several cases depending on the weight of Bri. 

If By, 41 is of weight 3, we note that forz = 1,2, --- , k, the coset con- 
taining B; also contains an element of weight one, namely the element 
7 obtained as the product of B; with the letter 7B; of the given (n, k)- 
alphabet. Of the remaining (2° — k) B’s, one is of weight zero, ¢ are of 

£ 
weight one, é 
have, then a = lar = (+ k = n. Now every B of weight 4 occurs in 
the list of generators 1B1, 2B.,--- , kB, . It follows that on multi- 
plying this list of generators by any B of weight 3, at least one element 
of weight two will result. (E.g., (1’2’3’)(j1’2/3’4’) = 74’) Thus every 
coset with a B of weight 2 or 3 contains an element of weight 2 and 
a = 2'— a — a. 

The argument in case B;,41 is of weight two or one is similar. 


) are of weight 2 and the remaining are of weight 3. We 


2.8 MODULAR REPRESENTATIONS OF C. 


In order to explain one of the methods used to obtain the best (n, k)- 
alphabets listed in Tables IJ and III, it is necessary to digress here to 
present additional theory. 
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It has been remarked that every (n, k)-alphabet is isomorphic with 
C;, . Let us suppose the elements of C; listed in a column starting with J 
and proceeding in order J, 1, 2, 3, --- , k, 12, 18, --- , (k — Ik, 123, 
tee eee , 128 --- k. The elements of a given (n, k)-alphabet can be 
paired off with these abstract elements so as to preserve group multipli- 
cation. This can be done in many different ways. The result is a matrix 
with elements zero and one with n columns and 2" rows, these latter 
being labelled by the symbols J, 1, 2, --- etc. What can be said about 
the columns of this matrix? How many different columns are possible 
when all (n, k)-alphabets and all methods of establishing isomorphism 
with C;, are considered? 

In a given column, once the entries in rows 1, 2, --- , k are known, the 
entire column is determined by the group property. There are therefore 
only 2” possible different columns for such a matrix. A table showing 
these 2" possible columns of zeros and ones will be called a modular repre- 
sentation table for C),. An example of such a table is shown for k = 4 in 
Table VI. 

It is clear that the columns of a modular representation table can also 
be labelled by the elements of C;, , and that group multiplication of these 
column labels is isomorphic with mod 2 addition of the columns. The 
table is a symmetric matrix. The element with row label A and column 
label B is one if the symbols A and B have an odd number of different 
numerals in common and is zero otherwise. 

Every (n, k)-alphabet can be made from a modular representation 
table by choosing n columns of the table (with possible repetitions) at 
least k of which form an independent set. 


TaBLeE VI — Moputar REPRESENTATION TABLE FOR Group C4 


I 1 2 3 4 12 13 14 23 24 34 123 124 134 234 1234 





I 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 

1 0 1 0 0 0 1 21 21 0 0 0 21 1 1 0 1 
2/0 0 1 0 0 1 0 0 1 1 0 +21 1 0 1 1 

oy Oo 0. OE oh ® 2. OOD OO OR Cd 0 1 1 1 
4/0 0 0 0 1 0 0 1 0 1 1 = =0 1 1 1 1 
12. 00 Ae =D oO. i oO Bee ok OF 0 0 1 1 0 
123°; 0 1 0 1 0 1 0 1 21 0 1 =0 1 0 1 0 
14,0 1 0 0 1 1 1 0 0 212 21 ~21 0 0 1 0 
23 ;0 0 1 1 0 1 1 0 0 1 421 ~=«0 1 1 0 0 
24/0 0 1 0 1 1 0 1 21 0 1 ~=+721 0 1 0 0 
34/0 0 0 1 1 0 1 21 1 2 0 21 i! 0 0 0 
1223/0 1 1 1 0 0 0 1 0 1 21 ~427 0 0 0 1 
124;0 1 1 0 1 0 1 0 1 0 21 = «0 1 0 0 1 
134 |}0 1 0 1 1 1 0 0 1 21 0 =0 0 1 0 1 
0 0 1 1 1 21 1 1 0 0 0 0 0 0 1 1 

0 1! 1 1 21 0 0 0 0 0 0 41 1 1 1 0 
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We henceforth exclude consideration of the column J of a modular 
representation table. Its inclusion in an (n, k)-alphabet is clearly a waste 
of 1 binary digit. 

It is easy to show that every column of a modular representation table 
for C;, contains exactly 2"”* ones. Since an (n, k)-alphabet is made from 
n such columns the alphabet contains a total of n2* ones and we have 

Proposition 6. The weights of an (n, k)-alphabet form a partition of 
n2* into 2" — 1 non-zero parts, each part being an integer from the set 
| eee | 
The identity element always has weight zero, of course. 

It is readily established that the product of two elements of even 
weight is again an element of even weight as is the product of two ele- 
ments of odd weight. The product of an element of even weight with an 
element of odd weight yields an element of odd weight. 

The elements of even weight of an (n, k)-alphabet form a subgroup 
and the preceding argument shows that this subgroup must be of order 
2* or 2°". If the group of even elements is of order 2°’, then the collec- 
tion of even elements is a possible (n, k — 1)-alphabet. This (n, k — 1) 
alphabet may, however, contain the column J of the modular represen- 
tation table of Cy_1. We therefore have 

Proposition 7. The partition of Proposition 6 must be either into 
2* — 1 even parts or else into 2°” odd parts and 2°" — 1 even parts. 
In the latter case, the even parts form a partition of a2” where a is 
some integer of the set k — 1, k, --- , n and each of the parts is an in- 
teger from the set 1, 2, --- , n. 


2.9 THE CHARACTERS OF C;, 


Let us replace the elements of B, (each of which is a sequence of zeros 
and ones) by sequences of +1’s and —1’s by means of the following 
substitution 


0Ono1 


1l<o -1. 


(13) 


The multiplicative properties of elements of B, can be preserved in this 
new notation if we define the product of two +1, —1 symbols to be the 
symbol whose 7th component is the ordinary product of the 7th compo- 
nents of the two factors. For example, 1011 and 0110 become respectively 
—11 —1 —1 andi —1 —11. We have 


Cat Get et aes 
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corresponding to the fact that 
(1011) (0110) = (1101) 


If the +1, —1 symbols are regarded as shorthand for diagonal matrices, 
so that for example 


-—li -1-l< 


oOo Coc Fe 
oor oO 
| 
oro © 
re OO & 


then group multiplication corresponds to matrix multiplication. 

(While much of what follows here can be established in an elementary 
way for the simple group at hand, it is convenient to fall back upon the 
established general theory of group representations’ for several proposi- 
tions. 

The substitution (18) converts a modular representation table (col- 
umn J included) into a square array of +1’s and —1’s. Each column (or 
row) of this array is clearly an irreducible representation of Ci, . Since C; 
is Abelian it has precisely 2° irreducible representations each of degree 
one. These are furnished by the converted modular table. This table also 
furnishes then the characters of the irreducible representations of C, 
and we refer to it henceforth as a character table. 

Let x°(A) be the entry of the character table in the row labelled A and 
column labelled a. The orthogonality relationship for characters gives 


Dd x"(A)x*(A) = 2"S08 
ACC, 
Dd x*(A)x(B) = 2"b45 
acc, 
where 6 is the usual Kronecker symbol. In particular 
DX x'(A)x'(4) = 2 x4) =0, BHI 
ACCE ACCE 


Since each x°(A) is +1 or —1, these must occur in equal numbers in any 
column 6 # J. This implies that each column except J. of the modular 
representation table contains 2° ones, a fact used earlier. 

Every matrix representation of C;, can be reduced to its irreducible 
components. If the trace of the matrix representing the element A in an 
arbitrary matrix representation of C; is x(A), then this representation 
contains the irreducible representation having label 8 in the character 
table dg times where 


230 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 


= 

2" sec 
Every (n, k)-alphabet furnishes us with a matrix representation of C; 

by means of (13) and the procedure outlined below (13). The trace x(A) 


of the matrix representing the element A of C; is related to the weight 
of the letter by 


dg = x(A)x?(A) (14) 


x(A) = n — 2w(A) (15) 


Equations (14) and (15) permit us to compute from the weights of an 
(n, k)-alphabet what irreducible representations are present in the alpha- 
bet and how many times each is contained. It is assumed here that the 
given alphabet has been made isomorphic to C;, and that the weights are 
labelled by elements of C; . 

Consider the converse problem. Given a set of numbers wi, we, °°: , 
wk that satisfy Propositions 6 and 7. From these we can compute 
quantities x; = n — 2w; as in (15). It is clear that the given w’s will 
constitute the weights of an (n, k)-alphabet if and only if the 2° x; can 
be labelled with elements of C; so that the 2" sums (14) (8 ranges over 
all elements of C,) are non-negative integers. The integers dg tell what 
representations to choose to construct an (n, k)-alphabet with the given 
weights wy . 


2.10 CONSTRUCTION OF BEST ALPHABETS 


A great many different techniques were used to construct the group 
alphabets listed in Tables IT and III and to show that for each n and k 
there are no group alphabets with smaller probability of error. Space 
prohibits the exhibition of proofs for all the alphabets listed. We content 
ourselves here with a sample argument and treat the case n = 10, k = 
4 in detail. 

According to (2) there are N(10, 4) = 53,748,987 different (10, 4)- 
alphabets. We now show that none is better than the one given in Table 
III. The letters of this alphabet and weights of the letters are 


I 
167810 
267910 
35689 10 
45789 10 
1289 
13579 


oR & O Or or © 
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14569 
23578 
24568. 
3467 
123579 
1245710 
1348 10 
234910 
12346789 


oOo o es He or or or 


The notation is that of Section 2.1. By actually forming the standard 
array of this alphabet, it is verified that 


a = uA aQay= 10, a= 39, ag = T4. 


Table II shows & ) = 45, whereas a2 = 389, so the given alphabet 


does not correct all possible double errors. In the standard array for the 
alphabet, 39 coset leaders are of weight 2. Of these 39 cosets, 33 have 
only one element of weight 2; the remaining 6 cosets each contain two 
elements of weight 2. This is due to the two elements of weight 4 in the 
given group, namely 1289 and 3467. A portion of the standard array 
that demonstrates these points is 


I 1289 3467 
12 89 

18 29 

19 28 : 

34 . 67 
36 47 


In order to have a smaller probability of error than the exhibited 
alphabet, it is necessary that a (10, 4)-alphabet have an a. > 39. We 
proceed to show that this is impossible by consideration of the weights 
of the letters of possible (10, 4)-alphabets. 

We first show that every (10, 4)-alphabet must have at least one ele- 
ment (other than the identity, J) of weight less than 5. By Propositions 
6 and 7, Section 2.8, the weights must form a partition of 10-8 = 80 into 
15 positive parts. If the weights are all even, at least two must be less 
than6 since 14-6 = 84 > 80. If eight of the weights are odd, we see from 
8-5 + 7-6 = 82 > 80 that at least one weight must be less than 5. 
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An alphabet with one or more elements of weight 1 must have an 
a2 < 36, for there are nine elements of weight 2 which cannot possibly 
be coset leaders. To see this, suppose (without loss of generality) that 
the alphabet contains the letter 1. The elements 12, 18, 14, --- 1 10 can- 
not possibly be coset leaders since the product of any one of them with 
the letter 1 yields an element of weight 1. 

An alphabet with one or more elements of weight 2 must have an 
a. < 37. Suppose for example, the alphabet contained the letter 12. 
Then 13 and 23 must be in the same coset, 14 and 24 must be in the 
same coset, -:- , 1 10 and 2 10 must be in the same coset. There are at 
least eight elements of weight two which are not coset leaders. 

Each element of weight 3 in the alphabet prevents three elements of 
weight 2 from being coset leaders. For example, if the alphabet contains 
123, then 12, 18, and 23 cannot be coset leaders. We say that the three 
elements of weight 2 are ‘‘blocked”’ by the letter of weight 3. Suppose an 
alphabet contains at least three letters of weight three. There are several 
cases: (A) if three letters have no numerals in common, e.g., 128, 456, 
789, then nine distinct elements of weight 2 are blocked and a2 S 36; 
(B) if no two of the letters have more than a single numeral in common, 
e.g., 123, 345, 789, then again nine elements of weight 2 are blocked and 
a2, < 36; and (C) if two of the letters of weight 3 have two numerals in 
common, e.g., 128, 234, then their product is a letter of weight 2 and by 
the preceding paragraph a2 S 37. If an alphabet contains exactly two 
elements of weight 3 and no elements of weight 2, the elements of weight 
3 block six elements of weight 2 and a2 S 39. 

The preceding argument shows that to be better than the exhibited 
alphabet a (10, 4)-alphabet with letters of weight 3 must have just one 
such letter. A similar argument (omitted here) shows that to be better 
than the exhibited alphabet, a (10, 4)-alphabet cannot contain more 
than one element of weight 4. Furthermore, it is easily seen that an 
alphabet containing one element of weight 3 and one element of weight 
4 must have an az S 39. 

The only new contenders for best (10, 4)-alphabet are, therefore, 
alphabets with a single letter other than J of weight less than 5, and this 
letter must have weight 3 or 4. Application of Propositions 6 and 7 show 
that the only possible weights for alphabets of this sort are: 356’ and 
5°46° where 5’ means seven letters of weight 5, etc. We next show that 
there do not exist (10, 4)-alphabets having these weights. 

Consider first the suggested alphabet with weights 35’6". As explained 
in Section 2.9, from such an alphabet we can construct a matrix repre- 
sentation of C, having the character x(J) = 10, one matrix of trace 4, 
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seven of trace 0 and seven of trace —2. The latter seven matrices cor- 
respond to elements of even weight and together with J must represent 
a subgroup of order 8. We associate them with the subgroup generated 
by the elements 2, 3, and 4. We have therefore 


x7) = 10, x(2) = x(3) = x(4) = x(23) 
= x(24) = x(84) = x(284) = —2. 


Examination of the symmetries involved shows that it doesn’t matter 
how the remaining x; are associated with the remaining group elements. 
We take, for example 


x1) = 4, x12) = x(18) = x14) = x(123) 
= x(124) = x(134) = x(1234) = 0. 


Now form the sum shown in equation (14) with 8 = 1234 (i.e., with the 
character x" obtained from column 1234 of the Table VI by means 
of substitution (13). There results diz3, = 14 which is impossible. There- 
fore there does not exist a (10, 4)-alphabet with weights 35’6’. 

The weights 5°46° correspond to a representation of C, with character 
x(I) = 10, 0°, 2, (—2)°. We take the subgroup of elements of even weight 
to be generated by 2, 3, and 4. Except for the identity, it is clearly im- 
material to which of these elements we assign the character 2. We make 
the following assignment: x(J) = 10, x(2) = 2, x(8) = x(4) = x(23) = 
x(24) = x(84) = x(234) = —2, — x(1) = x(12) = x(18) = x(14) = 
x(123) = x(124) = x(134) = x(1234) = 0. The use of equation (14) 
shows that d,; = 14 which is impossible. 

It follows that of the 53,743,987 (10, 4)-alphabets, none is better than 
the one listed on Table ITI. 

Not all the entries of Table III were established in the manner just 
demonstrated for the (10, 4)-alphabet. In many cases the search for a 
best alphabet was narrowed down to a few alphabets by simple argu- 
ments. The standard arrays for the alphabets were constructed and the 
best alphabet chosen. For large n the labor in making such a table can 
be considerable and the operations involved are highly liable to error 
when performed by hand. 

I am deeply indebted to V. M. Wolontis who programmed the IBM 
CPC computer to determine the a’s of a given alphabet and who pa- 
tiently ran off many such alphabets in course of the construction of 
Tables II and III. I am also indebted to Mrs. D. R. Fursdon who eval- 
uated many of the smaller alphabets by hand. 
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